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PREFACE. 

Thb  usual  point  of  view  in  the  study  of  mechanics  is  that 
where  the  attention  is  mainly  directed  to  the  changes  which 
take  place  in  the  course  of  time  in  a  given  system.  The  prin- 
cipal problem  is  the  determination  of  the  condition  of  the 
system  with  respect  to  configuration  and  velocities  at  any 
required  time,  when  its  condition  in  these  respects  has  been 
given  for  some  one  time,  and  the  fundamental  equations  are 
those  which  express  the  changes  continually  taking  place  in 
the  system.  Inquiries  of  this  kind  are  often  simplified  by 
taking  into  consideration  conditions  of  the  system  other  than 
those  through  which  it  actually  passes  or  is  supposed  to  pass, 
but  our  attention  is  not  usually  carried  beyond  conditions 
differing  infinitesimally  from  those  which  are  regarded  as 
actual. 

For  some  purposes,  however,  it  is  desirable  to  take  a  broader 
view  of  the  subject  We  may  imagine  a  great  number  of 
systems  of  the  same  nature,  but  differing  in  the  configura- 
tions and  velocities  which  they  have  at  a  given  instant,  and 
differing  not  merely  infinitesimally,  but  it  may  be  so  as  to 
embrace  every  conceivable  combination  of  configuration  and 
velocities.  And  here  we  may  set  the  problem,  not  to  follow 
a  particular  system  through  its  succession  of  configurations, 
but  to  determine  how  the  whole  number  of  systems  will  be 
distributed  among  the  various  conceivable  configurations  and 
velocities  at  any  required  time,  when  the  distribution  has 
been  given  for  some  one  time.  The  fundamental  equation 
for  this  inquiry  is  that  which  gives  the  rate  of  change  of  the 
number  of  systems  which  fall  within  any  infinitesimal  limits 
of  configuration  and  velocity. 
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Such  inquiries  have  been  called  by  Maxwell  statistical 
They  belong  to  a  branch  of  mechanics  which  owes  its  origin  to 
the  desire  to  explain  the  laws  of  thermodynamics  on  mechan- 
ical principles,  and  of  which  Clausius,  Maxwell,  and  Boltz- 
mann  are  to  be  regarded  as  the  principal  founders.  The  first 
inquiries  in  this  field  were  indeed  somewhat  narrower  in  their 
scope  than  that  which  has  been  mentioned,  being  applied  to 
the  particles  of  a  system,  rather  than  to  independent  systems. 
Statistical  inquiries  were  next  directed  to  the  phases  (or  con- 
ditions with  respect  to  configuration  and  velocity)  which 
succeed  one  another  in  a  given  system  in  the  course  of  time. 
The  explicit  consideration  of  a  great  number  of  systems  and 
their  distribution  in  phase,  and  of  the  permanence  or  alteration 
of  this  distribution  in  the  course  of  time  is  perhaps  first  found 
in  Boltzmann's  paper  on  the  "  Zusammenhang  zwischen  den 
Satzen  iiber  das  Verhalten  mehratomiger  Gasmolekiile  mit 
Jacobi's  Princip  des  letzten  Multiplicators  "  (1871). 

But  although,  as  a  matter  of  history,  statistical  mechanics 
owes  its  origin  to  investigations  in  thermodynamics,  it  seems 
eminently  worthy  of  an  independent  development,  both  on 
account  of  the  elegance  and  simplicity  of  its  principles,  and 
because  it  yields  new  results  and  places  old  truths  in  a  new 
light  in  departments  quite  outside  of  thermodynamics.  More- 
over, the  separate  study  of  this  branch  of  mechanics  seems  to 
afford  the  best  foundation  for  the  study  of  rational  thermody- 
namics and  molecular  mechanics. 

The  laws  of  thermodynamics,  as  empirically  determined, 
express  the  approximate  and  probable  behavior  of  systems  of 
a  great  number  of  particles,  or,  more  precisely,  they  express 
the  laws  of  mechanics  for  such  systems  as  they  appear  to 
beings  who  have  not  the  fineness  of  perception  to  enable 
them  to  appreciate  quantities  of  the  order  of  magnitude  of 
those  which  relate  to  single  particles,  and  who  cannot  repeat 
their  experiments  often  enough  to  obtain  any  but  the  most 
probable  results.  The  laws  of  statistical  mechanics  apply  to 
conservative  systems  of  any  number  of  degrees  of  freedom, 
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and  are  exact.  This  does  not  make  them  more  difficult  to 
establish  than  the  approximate  laws  for  systems  of  a  great 
many  degrees  of  freedom,  or  for  limited  classes  of  such 
systems.  The  reverse  is  rather  the  case,  for  our  attention  is 
not  diverted  from  what  is  essential  by  the  peculiarities  of  the 
system  considered,  and  we  are  not  obliged  to  satisfy  ourselves 
that  the  effect  of  the  quantities  and  circumstances  neglected 
will  be  negligible  in  the  result  The  laws  of  thermodynamics 
may  be  easily  obtained  from  the  principles  of  statistical  me- 
chanics, of  which  they  are  the  incomplete  expression*  but 
tiiejr'make  a  somewhat  blind  guide  in  our  search  for  those 
laws.  Tliis  is  perhaps  the  principal  cause  of  the  slow  progress 
of  rational  thermodynamics,  as  contrasted  with  the  rapid  de- 
duction of  the  consequences  of  its  laws  as  empirically  estab- 
lished. To  this  must  be  added  that  the  rational  foundation 
of  thermodynamics  lay  in  a  branch  of  mechanics  of  which 
the  fundamental  notions  and  principles,  and  the  characteristic 
operations,  were  alike  unfamiliar  to  students  of  mechanics. 

We  may  therefore  confidently  believe  that  nothing  will 
more  conduce  to  the  clear  apprehension  of  the  relation  of 
thermodynamics  to  rational  mechanics,  and  to  the  interpreta- 
tion of  observed  phenomena  with  reference  to  their  evidence 
respecting  the  molecular  constitution  of  bodies,  than  the 
study  of  the  fundamental  notions  and  principles  of  that  de- 
partment of  mechanics  to  which  thermodynamics-  is  especially 
related. 

Moreover,  we  avoid  the  gravest  difficulties  when,  giving  up 
the  attempt  to  frame  hypotheses  concerning  the  constitution 
of  material  bodies,  we  pursue  statistical  inquiries  as  a  branch 
of  rational  mechanics.  In  the  present  state  of  science,  it 
seems  hardly  possible  to  frame  a  dynamic  theory  of  molecular 
action  which  shall  embrace  the  phenomena  of  thermody- 
namics, of  radiation,  and  of  the  electrical  manifestations 
which  accompany  the  union  of  atoms.  Yet  any  theory  is 
obviously  inadequate  which  does  not  take  account  of  all 
these  phenomena.     Even  if  we  confine  our  attention  to  the 
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phenomena  distinctively  thermodynamic,  we  do  not  escape 
difficulties  in  as  simple  a  matter  as  the  number  of  degrees 
of  freedom  of  a  diatomic  gas.  It  is  well  known  that  while 
theory  would  assign  to  the  gas  six  degrees  of  freedom  per 
molecule,  in  our  experiments  on  specific  heat  we  cannot  ac- 
count for  more  than  five.  Certainly,  one  is  building  on  an 
insecure  foundation,  who  rests  his  work  on  hypotheses  con- 
cerning the  constitution  of  matter. 

Difficulties  of  this  kind  have  deterred  the  author  from  at- 
tempting to  explain  the  mysteries  of  nature,  and  have  forced 
him  to  be  contented  with  the  more  modest  aim  of  deducing 
some  of  the  more  obvious  propositions  relating  to  the  statis- 
tical branch  of  mechanics.  Here,  there  can  be  no  mistake  in 
regard  to  the  agreement  of  the  hypotheses  with  the  facts  of 
nature,  for  nothing  is  assumed  in  that  respect.  The  only 
error  into  which  one  can  fall,  is  the  want  of  agreement  be- 
tween the  premises  and  the  conclusions,  and  this,  with  care, 
one  may  hope,  in  the  main,  to  avoid. 

The  matter  of  the  present  volume  consists  in  large  measure 
of  results  which  have  been  obtained  by  the  investigators 
mentioned  above,  although  the  point  of  view  and  the  arrange- 
ment may  be  different.  These  results,  given  to  the  public 
one  by  one  in  the  order  of  their  discovery,  have  necessarily, 
in  their  original  presentation,  not  been  arranged  in  the  most 
logical  manner. 

In  the  first  chapter  we  consider  the  general  problem  which 
has  been  mentioned,  and  find  what  may  be  called  the  funda- 
mental equation  of  statistical  mechanics.  A  particular  case 
of  this  equation  will  give  the  condition  of  statistical  equi- 
librium, t.  6.,  the  condition  which  the  distribution  of  the 
systems  in  phase  must  satisfy  in  order  that  the  distribution 
shall  be  permanent.  In  the  general  case,  the  fundamental 
equation  admits  an  integration,  which  gives  a  principle  which 
may  be  variously  expressed,  according  to  the  point  of  view 
from  which  it  is  regarded,  as  the  conservation  of  density-in- 
phase,  or  of  extension-in-phase,  or  of  probability  of  phase. 
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In  the  second  chapter,  we  apply  this  principle  of  conserva- 
tion of  probability  of  phase  to  the  theory  of  errors  in  the 
calculated  phases  of  a  system,  when  the  determination  of  the 
arbitrary  constants  of  the  integral  equations  are  subject  to 
error.  In  this  application,  we  do  not  go  beyond  the  usual 
approximations.  In  other  words,  we  combine  the  principle 
of  conservation  of  probability  of  phase,  which  is  exact,  with 
those  approximate  relations,  which  it  is  customary  to  assume 
in  the  "  theory  of  errors." 

In  the  third  chapter  we  apply  the  principle  of  conservation 
of  extension-in-phase  to  the  integration  of  the  differential 
equations  of  motion.  This  gives  Jacobi's  "last  multiplier,9' 
as  has  been  shown  by  Boltzmann. 

In  the  fourth  and  following  chapters  we  return  to  the  con- 
sideration of  statistical  equilibrium,  and  confine  our  attention 
to  conservative  systems.  We  consider  especially  ensembles 
of  systems  in  which  the  index  (or  logarithm)  of  probability  of 
phase  is  a  linear  function  of  the  energy.  This  distribution, 
on  account  of  its  unique  importance  in  the  theory  of  statisti- 
cal equilibrium,  I  have  ventured  to  call  canonical,  and  the 
divisor  of  the  energy,  the  modulu*  of  distribution.  The 
moduli  of  ensembles  have  properties  analogous  to  temperature, 
in  that  equality  of  the  moduli  is  a  condition  of  equilibrium 
with  respect  to  exchange  of  energy,  when  such  exchange  is 
made  possible. 

We  find  a  differential  equation  relating  to  average  values 
in  the  ensemble  which  is  identical  in  form  with  the  funda- 
mental differential  equation  of  thermodynamics,  the  average 
index  of  probability  of  phase,  with  change  of  sign,  correspond- 
ing to  entropy,  and  the  modulus  to  temperature 

For  the  average  square  of  the  anomalies  of  the  energy,  we 
find  an  expression  whieh  vanishes  in  comparison  with  the 
square  of  the  average  energy,  when  the  number  of  degrees 
of  freedom  is  indefinitely  increased.  An  ensemble  of  systems 
in  which  the  number  of  degrees  of  freedom  is  of  the  same 
order  of  magnitude  as  the  number  of  molecules  in  the  bodies 
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with  which  we  experiment,  if  distributed  canonically,  would 
therefore  appear  to  human  observation  as  an  ensemble  of 
systems  in  which  all  have  the  same  energy. 

We  meet  with  other  quantities,  in  the  development  of  the 
subject,  which,  when  the  number  of  degrees  of  freedom  is 
very  great,  coincide  sensibly  with  the  modulus,  and  with  the 
average  index  of  probability,  taken  negatively,  in  a  canonical 
ensemble,  and  which,  therefore,  may  also  be  regarded  as  cor- 
responding to  temperature  and  entropy.  The  correspondence 
is  however  imperfect,  when  the  number  of  degrees  of  freedom 
is  not  very  great,  and  there  is  nothing  to  recommend  these 
quantities  except  that  in  definition  they  may  be  regarded  as 
more  simple  than  those  which  have  been  mentioned.  In 
Chapter  XIV,  this  subject  of  thermodynamic  analogies  is 
discussed  somewhat  at  length. 

Finally,  in  Chapter  XV,  we  consider  the  modification  of 
the  preceding  results  which  is  necessary  when  we  consider 
systems  composed  of  a  number  of  entirely  similar  particles, 
or,  it  may  be,  of  a  number  of  particles  of  several  kinds,  all  of 
each  kind  being  entirely  similar  to  each  other,  and  when  one 
of  the  variations  to  be  considered  is  that  of  the  numbers  of 
the  particles  of  the  various  kinds  which  are  contained  in  a 
system.  This  supposition  would  naturally  have  been  intro- 
duced earlier,  if  our  object  had  been  simply  the  expression  of 
the  laws  of  nature.  It  seemed  desirable,  however,  to  separate 
sharply  the  purely  thermodynamic  laws  from  those  special 
modifications  which  belong  rather  to  the  theory  of  the  prop- 
erties of  matter. 

J.  W.  G. 

New  Haven,  December,  1901. 
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ELEMENTARY   PRINCIPLES   IN 
STATISTICAL  MECHANICS 

CHAPTEB  I. 

GENERAL  NOTIONS.    THE  PRINCIPLE  OP  CONSERVATION 
OF  EXTENSION-IN-PHASE. 

We  shall  use  Hamilton's  form  of  the  equations  of  motion  for 
a  system  of  n  degrees  of  freedom,  writing  jlf  . .  .qn  for  the 
(generalized)  coordinates,  qi,  ...  qn  for  the  (generalized)  ve- 
locities, and 

F1dq1  +  Ftdqt...  +  Fndqn  (1) 

for  the  moment  of  the  forces.  We  shall  call  the  quantities 
Ft,...Fn  the  (generalized)  forces,  and  the  quantities px . . .pn, 
defined  by  the  equations 

/>!  =  -#,   i>«  =  -4,    etc,  (2) 

dqt  dqt 

where  ep  denotes  the  kinetic  energy  of  the  system,  the  (gen- 
eralized) momenta.  The  kinetic  energy  is  here  regarded  as 
a  function  of  the  velocities  and  coordinates.  We  shall  usually 
regard  it  as  a  function  of  the  momenta  and  coordinates,* 
and  on  this  account  we  denote  it  by  ep.  This  will  not  pre- 
vent us  from  occasionally  using  formulae  like  (2),  where  it  is. 
sufficiently  evident  the  kinetic  energy  is  regarded  as  function 
of  the  j's  and  j's.  But  in  expressions  like  dep/dq1 ,  where  the 
denominator  does  not  determine  the  question,  the  kinetic 

*  The  use  of  the  momenta  instead  of  the  velocities  as  independent  variables 
is  the  characteristic  of  Hamilton's  method  which  gives  his  equations  of  motion 
their  remarkable  degree  of  simplicity.  We  shall  find  that  the  fundamental 
notions  of  statistical  mechanics  are  most  easily  defined,  and  are  expressed  in 
the  most  simple  form,  when  the  momenta  with  the  coordinates  are  used  to 
describe  the  state  of  a  system. 
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4  HAMILTON'S  EQUATIONS. 

energy  is  always  to  be  treated  in  the  differentiation  as  function 
of  the  p'a  and  j's. 
We  have  then 

*-&  *— S+j,» etc-       (3) 

These  equations  will  hold  for  any  forces  whatever,  If  the 
forces  are  conservative,  in  other  words,  if  the  expression  (1) 
is  an  exact  differential,  we  may  set 

*— &  *•— &  eto-  <4> 

where  etf  is  a  function  of  the  coordinates  which  we  shall  call 
the  potential  energy  of  the  system.  If  we  write  e  for  the 
total  energy,  we  shall  have 

€=€,  +  «„  (6) 

and  equations  (8)  may  be  written 

*=^'  »— ;c»  etc-  (6) 

The  potential  energy  (eff)  may  depend  on  other  variables 
beside  the  coordinates  qt . . .  $v  We  shall  often  suppose  it  to 
depend  in  part  on  coordinates  of  external  bodies,  which  we 
shall  denote  by  at,  e^,  etc.  We  shall  then  have  for  the  com- 
plete value  of  the  differential  of  the  potential  energy  * 

«fef  =  —  Fx  dq1 .  .  —  Fn  dqn  —  Ax  da*  —  A%  da*  —  etc.,         (7) 

where  A19  -4a,  etc.,  represent  forces  (in  the  generalized  sense) 
exerted  by  the  system  on  external  bodies.  For  the  total  energy 
(c)  we  shall  have 

<fe=  Vidpi  •  •  •  +  £»4>»  —  P\dqx . .  • 

— P%  dqn  —  -4j  <&!  —  A%  daf  —  etc.  (8) 

It  will  be  observed  that  the  kinetic  energy  (e„)  in  the 
most  general  case  is  a  quadratic  function  of  the  p's  (or  y's) 

•  It  will  be  observed,  that  although  we  call  #  the  potential  energy  of  the 
system  which  we  are  considering,  it  is  really  so  defined  as  to  include  that 
energy  which  might  be  described  as  mutual  to  that  system  and  external 
bodies. 
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involving  also  the  qfs  but  not  the  af& ;  that  the  potential  energy, 
when  it  exists,  is  function  of  the  ;'s  and  a's;  and  that  the 
total  energy,  when  it  exists,  is  function  of  the  p's  (or  j's),  the 
q%  and  the  a's.  In  expressions  like  dejdq^  them's,  and  not 
the  q%  are  to  be  taken  as  independent  variables,  as  has  already 
been  stated  with  respect  to  the  kinetic  energy. 

Let  us  imagine  a  great  number  of  independent  systems, 
identical  in  nature,  but  differing  in  phase,  that  is,  in  their 
condition  with  respect  to  configuration  and  velocity.  The 
forces  are  supposed  to  be  determined  for  every  system  by  the 
same  law,  being  functions  of  the  coordinates  of  the  system 
?i>  •  •  •  ?»>  either  alone  or  with  the  coordinates  alf  as,  etc.  of 
certain  external  bodies.  It  is  not  necessary  that  they  should 
be  derivable  from  a  force-function.  The  external  coordinates 
Oj,  Og,  etc.  may  vary  with  the  time,  but  at  any  given  time 
have  fixed  values.  In  this  they  differ  from  the  internal 
coordinates  jlf . . .  qn,  which  at  the  same  time  have  different 
values  in  the  different  systems  considered. 

Let  us  especially  consider  the  number  of  systems  which  at  a 
given  instant  fall  within  specified  limits  of  phase,  viz.,  those 
for  which 

jh'<Pi<Pi",  qi<qx<q"r 

Pl<pt<pt\ 


q{  <q%<  qi>  \ 
fc'<fc<j.V 


(9) 

p:<Pn<p", 

the  accented  letters  denoting  constants.  We  shall  suppose 
the  differences  p"  —  pt\  q"  —  qt\  etc.  to  be  infinitesimal,  and 
that  the  systems  are  distributed  in  phase  in  some  continuous 
manner,*  so  that  the  number  having  phases  within  the  limits 
specified  may  be  represented  by 

»W  -  aO  •  •  •  ti>»"  -p.O  to"  -  j»0 . . .  <s*"  -  j.0.    (io) 

*  In  strictness,  a  finite  number  of  systems  cannot  be  distributed  contin- 
uously in  phase.  But  by  increasing  indefinitely  the  number  of  systems,  we 
may  approximate  to  a  continuous  law  of  distribution,  such  as  is  here 
described.  To  avoid  tedious  circumlocution,  language  like  the  above  may 
be  allowed,  although  wanting  in  precision  of  expression,  when  the  sense  in 
which  it  is  to  be  taken  appears  sufficiently  clear. 
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or  more  briefly  by 

Ddfr  ...dpndqt...  dqn,  (11) 

where  D  is  a  function  of  the  p*&  and  y's  and  in  general  of  t  also, 
for  as  time  goes  on,  and  the  individual  systems  change  their 
phases,  the  distribution  of  the  ensemble  in  phase  will  in  gen- 
eral vary.  In  special  cases,  the  distribution  in  phase  will 
remain  unchanged.  These  are  cases  of  statistical  equilibrium. 
If  we  regard  all  possible  phases  as  forming  a  sort  of  exten- 
sion of  2  n  dimensions,  we  may  regard  the  product  of  differ- 
entials in  (11)  as  expressing  an  element  of  this  extension,  and 
D  as  expressing  the  density  of  the  systems  in  that  element. 
We  shall  call  the  product 

dpi  . .  •  dpn  dqi  • .  •  dqn  (12) 

an  element  of  exUrmorUn-phase,  and  D  the  denrity+n-phase 
of  the  systems. 

It  is  evident  that  the  changes  which  take  place  in  the  den- 
sity of  the  systems  in  any  given  element  of  extension-in- 
phase  will  depend  on  the  dynamical  nature  of  the  systems 
and  their  distribution  in  phase  at  the  time  considered. 

In  the  case  of  conservative  systems,  with  which  we  shall  be  . 
principally  concerned,  their  dynamical  nature  is  completely 
determined  by  the  function  which  expresses  the  energy  (e)  in 
terms  of  the  p%  q%  and  a's  (a  function  supposed  identical 
for  all  the  systems) ;  in  the  more  general  case  which  we  are 
considering,  the  dynamical  nature  of  the  systems  is  deter- 
mined by  the  functions  which  express  the  kinetic  energy  (ep) 
in  terms  of  the  pyB  and  q%  and  the  forces  in  terms  of  the 
qys  and  a's.  The  distribution  in  phase  is  expressed  for  the 
time  considered  by  D  as  function  of  the  p's  and  j's.  To  find 
the  value  of  dD/dt  for  the  specified  element  of  extension-in- 
phase,  we  observe  that  the  number  of  systems  within  the 
limits  can  only  be  varied  by  systems  passing  the  limits,  which 
may  take  place  in  4  n  different  ways,  viz.,  by  the  px  of  a  sys- 
tem passing  the  limit  pt\  or  the  limit  p{\  or  by  the  qx  of  a 
system  passing  the  limit  qx\  or  the  limit  }/',  etc.  Let  us 
consider  these  cases  separately. 
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In  the  first  place,  let  us  consider  the  number  of  systems 
which  in  the  time  dt  pass  into  or  out  of  the  specified  element 
by  px  passing  the  limit  px.  It  will  be  convenient,  and  it  is 
evidently  allowable,  to  suppose  dt  so  small  that  the  quantities 
px  dt,  qx  dt,  etc.,  which  represent  the  increments  of  px,  qx,  etc., 
in  the  time  dt  shall  be  infinitely  small  in  comparison  with 
the  infinitesimal  differences  px  —  px,  qx  —  qx\  etc.,  which  de- 
termine the  magnitude  of  the  element  of  extension-in-phase. 
The  systems  for  which  px  passes  the  limit  px  in  the  interval 
dt  are  those  for  which  at  the  commencement  of  this  interval 
the  value  of  px  lies  between  px  and  px  —  px  dt,  as  is  evident 
if  we  consider  separately  the  cases  in  which  px  is  positive  and 
negative.  Those  systems  for  which  px  lies  between  these 
limits,  and  the  other  p*&  and  qa  between  the  limits  specified  in 
(9),  will  therefore  pass  into  or  out  of  the  element  considered 
according  as  p  is  positive  or  negative,  unless  indeed  they  also 
pass  some  other  limit  specified  in  (9)  during  the  same  inter- 
val of  time.  But  the  number  which  pass  any  two  of  these 
limits  will  be  represented  by  an  expression  containing  the 
square  of  it  as  a  factor,  and  is  evidently  negligible,  when  dt 
is  sufficiently  small,  compared  with  the  number  which  we  are 
seeking  to  evaluate,  and  which  (with  neglect  of  terms  contain- 
ing dfi)  may  be  found  by  substituting  px  dt  for  p{'  —  px  in 
(10)  or  for  dpx  in  (11). 

The  expression 

Dpx  dt  dpt .  .  .  dp*  dqx  .  .  .  dqn  (13) 

will  therefore  represent,  according  as  it  is  positive  or  negative, 
the  increase  or  decrease  of  the  number  of  systems  within  the 
given  limits  which  is  due  to  systems  passing  the  limit  px.  A 
similar  expression,  in  which  however  D  and  p  will  have 
slightly  different  values  (being  determined  for  p"  instead  of 
p/),  will  represent  the  decrease  or  increase  of  the  number  of 
systems  due  to  the  passing  of  the  limit  pt".  The  difference 
of  the  two  expressions,  or 

%%&dp1...dpndql...dqndt  (U) 

dpx 
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will  represent  algebraically  the  decrease  of  the  number  of 
systems  within  the  limits  due  to  systems  passing  the  limits  p{ 
and  pt". 

The  decrease  in  the  number  of  systems  within  the  limits 
due  to  systems  passing  the  limits  q{  and  q"  may  be  found  in 
the  same  way.    This  will  give 

(^+^)*..-**..-*.*     <15> 

for  the  decrease  due  to  passing  the  four  limits  p{%  p{'*  q{%  qx". 
But  since  the  equations  of  motion  (3)  give 

the  expression  reduces  to 

(jZP*  +  gr  ft)  #>  •  •  •  *^i  •  •  •  dqndt.  (17) 

If  we  prefix  2  to  denote  summation  relative  to  the  suffixes 
1 . . .  n,  we  get  the  total  decrease  in  the  number  of  systems 
within  the  limits  in  the  time  dt.    That  is, 

2  \dpVx  +  da  *x)  dpi ' '  *  dPn  ^  '  ' '  dq%  dt  = 

—  dDdp1...dpndql...  dqn}     (18) 

wM-"a^+*ft>  (19) 

where  the  suffix  applied  to  the  differential  coefficient  indicates 
that  the  j?'s  and  j's  are  to  be  regarded  as  constant  in  the  differ- 
entiation.   The  condition  of  statistical  equilibrium  is  therefore 

If  at  any  instant  this  condition  is  fulfilled  for  all  values  of  the 
p's  and  q%  (dD/dt)p,q  vanishes,  and  therefore  the  condition 
will  continue  to  hold,  and  the  distribution  in  phase  will  be 
permanent,  so  long  as  the  external  coordinates  remain  constant. 
But  the  statistical  equilibrium  would  in  general  be  disturbed 
by  a  change  in  the  values  of  the  external  coordinates}  which 


or 
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would  alter  the  values  of  the  pyB  as  determined  by  equations 
(8),  and  thus  disturb  the  relation  expressed  in  the  last  equation. 
If  we  write  equation  (19)  in  the  form 


SL*+»£**+S**M      « 


it  will  be  seen  to  express  a  theorem  of  remarkable  simplicity. 
Since  2>  is  a  function  of  t,  pt,  .  •  •  pu,  qt ,  •  •  •  q„  its  complete 
differential  will  consist  of  parts  due  to  the  -variations  of  all 
these  quantities.  Now  the  first  term  of  the  equation  repre- 
sents the  increment  of  D  due  to  an  increment  of  t  (with  con- 
stant values  of  them's  and  ;'s),  and  the  rest  of  the  first  member 
represents  the  increments  of  J>  due  to  increments  of  the  py& 
and  ;'s,  expressed  by  px  dt,  qt  dt,  etc  But  these  are  precisely 
the  increments  which  the  p*B  and  g's  receive  in  the  movement 
of  a  system  in  the  time  dt  The  whole  expression  represents 
the  total  increment  of  2>  for  the  varying  phase  of  a  moving 
system.    We  have  therefore  the  theorem :  — 

In  an  ensemble  of  mechanical  systems  identical  in  nature  and 
subject  to  force*  determined  by  identical  laws,  but  distributed 
in  phase  in  any  continuous  manner,  the  density-inphase  is 
constant  in  time  for  the  varying  phases  of  a  moving  system; 
provided,  that  the  forces  of  a  system  are  functions  of  its  co- 
ordinates, either  alone  or  with  the  time.* 

This  may  be  called  the  principle  of  conservation  qf  density- 
in-phase.    It  may  also  be  written 

where  a, ...  h  represent  the  arbitrary  constants  of  the  integral 
equations  of  motion,  and  are  suffixed  to  the  differential  co 

*  The  condition  that  the  forces  Flt . . .  Fn  are  functions  of  ft , . . .  qn  and 
altatf  etc.,  which  last  are  functions  of  the  time,  is  analytically  equiralent 
to  the  condition  that  Ft.  ...Fn  are  functions  of  ft,  ...?■  and  the  time. 
Explicit  mention  of  the  external  coordinates,  a1>atP  etc.,  has  been  made  in 
the  preceding  pages,  because  our  purpose  will  require  us  hereafter  to  con- 
sider these  coordinates  and  the  connected  forces,  Av  A*  etc.,  which  repre- 
sent the  action  of  the  systems  on  external  bodies. 
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efficient  to  indicate  that  they  are  to  be  regarded  as  constant 
in  the  differentiation. 

We  may  give  to  this  principle  a  slightly  different  expres- 
sion.   Let  us  call  the  value  of  the  integral 


/.../*• 


dpndqt...dqn  (23) 


taken  within  any  limits  the  extermon-in-phase  within  those 
limits. 

When  the  phases  bounding  an  eztension-tn-phase  vary  in 
the  course  of  time  according  to  the  dynamical  laws  of  a  system 
subject  to  forces  which  are  functions  of  the  coordinates  either 
alone  or  with  the  time,  the  value  of  the  extension-in-phase  thus 
bounded  remains  constant.  In  this  form  the  principle  may  be 
called  the  principle  of  conservation  of  eztensioririn-phase.  In 
some  respects  this  may  be  regarded  as  the  most  simple  state- 
ment of  the  principle,  since  it  contains  no  explicit  reference 
to  an  ensemble  of  systems. 

Since  any  extension-in-phase  may  be  divided  into  infinitesi- 
mal portions,  it  is  only  necessary  to  prove  the  principle  for 
an  infinitely  small  extension.  The  number  of  systems  of  an 
ensemble  which  fall  within  the  extension  will  be  represented 
by  the  integral 

I  ...  I  D  dpi  .  •  •  dpn  dqt .  .  .  dq^ 

If  the  extension  is  infinitely  small,  we  may  regard  D  as  con- 
stant in  the  extension  and  write 


5  J  .  .  .  /  dpx  .  .  .  dpn  dqt  .  .  .  dqn 


for  the  number  of  systems.  The  value  of  this  expression  must 
be  constant  in  time,  since  no  systems  are  supposed  to  be 
created  or  destroyed,  and  none  can  pass  the  limits,  because 
the  motion  of  the  limits  is  identical  with  that  of  the  systems. 
But  we  have  seen  that  D  is  constant  in  time,  and  therefore 
the  integral 

/  . .  .  J  dpt .  . .  dpndqx  .  . .  dqn 


tnt 
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which  we  have  called  the  extenrion-in-phase,  is  also  constant 
in  time.* 

Since  the  system  of  coordinates  employed  in  the  foregoing 
discussion  is  entirely  arbitrary,  the  values  of  the  coordinates 
relating  to  any  configuration  and  its  immediate  vicinity  do 
not  impose  any  restriction  upon  the  values  relating  to  other 
configurations.  The  fact  that  the  quantity  which  we  have 
called  density-in-phase  is  constant  in  time  for  any  given  sys- 
tem, implies  therefore  that  its  value  is  independent  of  the 
coordinates  which  are  used  in  its  evaluation.  For  let  the 
density-in-phase  as  evaluated  for  the  same  time  and  phase  by 
one  system  of  coordinates  be  2V*  and  by  another  system  2>s'. 
A  system  which  at  that  time  has  that  phase  will  at  another 
time  have  another  phase.  Let  the  density  as  calculated  for 
this  second  time  and  phase  by  a  third  system  of  coordinates 
be  2>8".  Now  we  may  imagine  a  system  of  coordinates  which 
at  and  near  the  first  configuration  will  coincide  with  the  first 
system  of  coordinates,  and  at  and  near  the  second  configuration 
will  coincide  with  the  third  system  of  coordinates.  This  will 
give  Dxf  =  D8".  Again  we  may  imagine  a  system  of  coordi- 
nates which  at  and  near  the  first  configuration  will  coincide 
with  the  second  system  of  coordinates,  and  at  and  near  the 

*  If  we  regard  a  phase  as  represented  by  a  point  in  space  of  2  a  dimen- 
sions, the  changes  which  take  place  in  the  course  of  time  in  our  ensemble  of 
systems  will  be  represented  by  a  current  in  such  space.  This  current  will 
be  steady  so  long  as  the  external  coordinates  are  not  raided.  In  any  case 
the  current  will  satisfy  a  law  which  in  its  various  expressions  is  analogous 
to  the  hydrodynamic  law  which  may  be  expressed  by  the  phrases  conserva- 
tion of  volume*  or  conservation  of  density  about  a  moving  point,  or  by  the  equation 

dx      dv      dz      A 
dx^  dy^dz     v 

The  analogue  in  statistical  mechanics  of  this  equation,  vis., 

&*£*$+£*--■ 

may  be  derived  directly  from  equations  (3)  or  (6),  and  may  suggest  such 
theorems  as  have  been  enunciated,  if  indeed  it  is  not  regarded  as  making 
them  intuitively  evident  The  somewhat  lengthy  demonstrations  given 
above  will  at  least  serve  to  give  precision  to  the  notions  involved,  and 
familiarity  with  their  use. 
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second  configuration  will  coincide  with  the  third  system  of 
coordinates.    This  will  give  D%'  =  2>8".    We  have  therefore 

zv  =  iy. 

It  follows,  or  it  may  be  proved  in  the  same  way,  that  the 
value  of  an  extension-in-phase  is  independent  of  the  system 
of  coordinates  which  is  used  in  its  evaluation.  This  may 
easily  be  verified  directly.  If  ft, . . .  qu,  Ql9 . . .  Qu  are  two 
systems  of  coordinates,  and  pv . .  .p„  Plf . . .  Pu  the  cor- 
responding momenta,  we  have  to  prove  that 

J...Jdp1...dp%dqx...dq%=f...fdP1...dP%dQ1...dQn,(24) 

when  the  multiple  integrals  are  taken  within  limits  consisting 
of  the  same  phases.  And  this  will  be  evident  from  the  prin- 
ciple on  which  we  change  the  variables  in  a  multiple  integral, 
if  we  prove  that 

d(Pu  . . .  Pn,  Qu  •  •  •  ft)  =  -  ^okx 

d(Pi>  --Pn,qu  .••?»)         ' 
where  the  first  member  of  the  equation  represents  a  Jacobian 
or  functional  determinant.    Since  all  its  elements  of  the  form 
dQ/dp  are  equal  to  zero,  the  determinant  reduces  to  a  product 
of  two,  and  we  have  to  prove  that 

d(P1,...PJd(QlJ...Qn)^1 
d(pi,..  .pn)    d(qu...q^ 

We  may  transform  any  element  of  the  first  of  these  deter- 
minants as  follows.  By  equations  (2)  and  (8),  and  in 
view  of  the  fact  that  the  #'s  are  linear  functions  of  the  q*a 
and  therefore  of  the  ^'s,  with  coefficients  involving  the  y's, 
so  that  a  differential  coefficient  of  the  form  dQr/dp9  is  function 
of  the  j's  alone,  we  get* 
*  The  form  of  the  equation 

—  ^fe.—   ^    <f*p 

dp*  d$*      dClm  dp, 
in  (27)  reminds  us  of  the  fundamental  identity  in  the  differential  calculut 
relating  to  the  order  of  differentiation  with  respect  to  independent  variables. 
But  it  will  be  observed  that  here  the  variables  Q*  and  p9  are  not  independent 
and  that  the  proof  depends  on  the  linear  relation  between  the  (jps  and  the  p's. 
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_d_  dc, 


dp,d$m     r*x\d4rdQmdp,) 


<t<L'=»u&4fr/    <*&«>,    d<3.  '    l  ' 

But  since  *KS(%^)' 


(28) 
(29) 

(30) 


Therefore, 

d(Plf . . .  P.)  _  <*(gly . . .  y.)  _  rffo, . . .  qj 

<Kpi,  •  •  /v)    «*(&, ...  do    <*(&, ...  00" 

The  equation  to  be  proved  is  thus  reduced  to 
d(qi,...qj  *(<&,...  QJ 

which  is  easily  proved  by  the  ordinary  rule  for  the  multiplica- 
tion of  determinants. 

The  numerical  value  of  an  extension-in-phase  will  however 
depend  on  the  units  in  which  we  measure  energy  and  time. 
For  a  product  of  the  form  dp  dq  has  the  dimensions  of  energy 
multiplied  by  time,  as  appears  from  equation  (2),  by  which 
the  momenta  are  defined.  Hence  an  extension-in-phase  has 
the  dimensions  of  the  nth  power  of  the  product  of  energy 
and  time.  In  other  words,  it  has  the  dimensions  of  the  nth 
power  of  action,  as  the  term  is  used  in  the  *  principle  of  Least 
Action.9 

If  we  distinguish  by  accents  the  values  of  the  momenta 
and  coordinates  which  belong  to  a  time  H ,  the  unaccented 
letters  relating  to  the  time  t,  the  principle  of  the  conserva- 
tion of  extension-in-phase  may  be  written 

J  ...  /  dpl...dpndql...dqm=:j  ...J  dpj ...dpjdqt' ...dqn',  (81) 

or  more  briefly 

J  •  •  -Jdpx.. .  dqn=J. .  .Jdpt' . . .  dqj,  (82) 
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the  limiting  phases  being  those  which  belong  to  the  same 
systems  at  the  times  t  and  tf  respectively.  But  we  have 
identically 

for  such  limits.  The  principle  of  conservation  of  extension-in- 
phase  may  therefore  be  expressed  in  the  form 

dip;...*.*)-1'  (33) 

This  equation  is  easily  proved  directly.  For  we  have 
identically 

d(pu  • « *  ft)       <?(Pi>  -  *  ft)  <g(Pi">  • ;  -  ft") 
<*(M  •  •  •  ft)     <*Cft",  •  •  •  ft")  <*(/>i'>  •  •  •  ftO  ' 
where  the  double  accents  distinguish  the  values  of  the  momenta 
and  coordinates  for  a  time  tf'.    If  we  vary  t,  while  H  and  t" 
remain  constant,  we  have 

d  d(plf ...  ft)  _  d(  pj\ . .  .  ft")  d_  d(pu...qn) 
dtd(Pl>,...qJ)-   tf(iV,...ft')    dtd(Pl»,...qn»y    V> 

Now  since-  the  time  t'  is  entirely  arbitrary,  nothing  prevents 
us  from  making  f  identical  with  t  at  the  moment  considered. 
Then  the  determinant 

d(pi , . . .  ft) 

d(M..ft") 
will  have  unity  for  each  of  the  elements  on  the  principal 
diagonal,  and  zero  for  all  the  other  elements.  Since  every 
term  of  the  determinant  except  the  product  of  the  elements 
on  the  principal  diagonal  will  have  two  zero  factors,  the  differen- 
tial of  the  determinant  will  reduce  to  that  of  the  product  of 
these  elements,  %.  e.,  to  the  sum  of  the  differentials  of  these 
elements.     This  gives  the  equation 

J*    **(ft  >  "  •  ft)   __  dpi  dpn        dqx  dqn 

dt  d{pj',  . . .  qj*)  ~  dPl»  *  '  '  *  dpj'  +  dqx»  •'•"*■  dqj': 

Now  since  t  =  *",  the  double  accents  in  the  second  member 
of  this  equation  may  evidently  be  neglected.  This  will  give, 
in  virtue  of  such  relations  as  (16), 
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d   d(pjf  ...g»)    _Q 

daW,...!.")-"* 
which  substituted  in  (84)  will  give 
d  d(pl9...qn) 

The  determinant  in  this  equation  is  therefore  a  constant,  the 
value  of  which  may  be  determined  at  the  instant  when  t  =  f, 
when  it  is  evidently  unity.  Equation  (88)  is  therefore 
demonstrated. 

Again,  if  we  write  a, ...  A  for  a  system  of  2  n  arbitrary  con- 
stants of  the  integral  equations  of  motion,  pv  qv  etc.  will  be 
functions  of  a, ...  A,  and  t9  and  we  may  express  an  extension- 
in-phase  in  the  form 

^-i^da...^.  (36) 

If  we  suppose  the  limits  specified  by  values  of  a, ...  A,  a 
system  initially  at  the  limits  will  remain  at  the  limits. 
The  principle  of  conservation  of  extension-in-phase  requires 
that  an  extension  thus  bounded  shall  have  a  constant  value. 
This  requires  that  the  determinant  under  the  integral  sign 
shall  be  constant,  which  may  be  written 

dt  d(a,...h)  -°'  <*' 

This  equation,  which  may  be  regarded  as  expressing  the  prin- 
ciple of  conservation  of  extension-in-phase,  may  be  derived 
directly  from  the  identity 

d(j>x, . . .  g»)  _  d(pu  ...qn)  d(ptf9  . . .  qj) 
d(a, ...  A)        d(ptfy . . .  qj)    d(a, ...  A) 

in  connection  with  equation  (83). 

Since  the  coordinates  and  momenta  are  functions  of  a, ...  A, 
and  £,  the  determinant  in  (86)  must  be  a  function  of  the  same 
variables,  and  since  it  does  not  vary  with  the  time,  it  must 
be  a  function  of  a, ...  A  alone.    We  have  therefore 

Sjg^- *»(*...»).  (37) 


Digitized  by  VjOOQIC 


16  CONSERVATION  OF 

It  is  the  relative  numbers  of  systems  which  fall  within  dif- 
ferent limits,  rather  than  the  absolute  numbers,  with  which  we 
are  most  concerned.  It  is  indeed  only  with  regard  to  relative 
numbers  that  such  discussions  as  the  preceding  will  apply 
with  literal  precision,  since  the  nature  of  our  reasoning  implies 
that  the  number  of  systems  in  the  smallest  element  of  space 
which  we  consider  is  very  great.  This  is  evidently  inconsist- 
ent with  a  finite  value  of  the  total  number  of  systems,  or  of 
the  density-in-phase.  Now  if  the  value  of  D  is  infinite,  we 
cannot  speak  of  any  definite  number  of  systems  within  any 
finite  limits,  since  all  such  numbers  are  infinite.  But  the 
ratios  of  these  infinite  numbers  may  be  perfectly  definite.  If 
we  write  JVfor  the  total  number  of  systems,  and  set 

P  =  £,  (38) 

P  may  remain  finite,  when  N  and  D  become  infinite.  The 
integral 

f...fpdp1...dqn  (39) 

taken  within  any  given  limits,  will  evidently  express  the  ratio 
of  the  number  of  systems  falling  within  those  limits  to  the 
whole  number  of  systems.  This  is  the  same  thing  as  the 
probability  that  an  unspecified  system  of  the  ensemble  (i.  e. 
one  of  which  we  only  know  that  it  belongs  to  the  ensemble) 
will  lie  within  the  given  limits.    The  product 

PdPl...dq%  (40) 

expresses  the  probability  that  an  unspecified  system  of  the 
ensemble  will  be  found  in  the  element  of  extension-in-phase 
dpi ...  dqn.  We  shall  call  P  the  coefficient  of  probability  of  the 
phase  considered.  Its  natural  logarithm  we  shall  call  the 
index  of  probability  of  the  phase,  and  denote  it  by  the  letter  q. 
If  we  substitute  NP  and  Ne*  for  2)  in  equation  (19),  we  get 
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The  condition  of  statistical  equilibrium  may  be  expressed 
by  equating  to  zero  the  second  member  of  either  of  these 
equations. 
The  same  substitutions  in  (22)  give 

($._-*         <»> 


and 


That  is,  the  values  of  P  and  q,  like  those  of  D,  are  constant 
in  time  for  moving  systems  of  the  ensemble.  From  this  point 
of  view,  the  principle  which  otherwise  regarded  has  been 
called  the  principle  of  conservation  of  density-in-phase  or 
conservation  of  extension-in-phase,  may  be  called  the  prin- 
ciple of  conservation  of  the  coefficient  (or  index)  of  proba- 
bility of  a  phase  varying  according  to  dynamical  laws,  or 
more  briefly,  the  principle  of  conservation  of  probability  of 
phase.  It  is  subject  to  the  limitation  that  the  forces  must  be 
functions  of  the  coordinates  of  the  system  either  alone  or  with 
the  time. 

The  application  of  this  principle  is  not  limited  to  cases  in 
which  there  is  a  formal  and  explicit  reference  to  an  ensemble  of 
systems.  Yet  the  conception  of  such  an  ensemble  may  serve 
to  give  precision  to  notions  of  probability.  It  is  in  fact  cus- 
tomary in  the  discussion  of  probabilities  to  describe  anything 
which  is  imperfectly  known  as  something  taken  at  random 
from  a  great  number  of  things  which  are  completely  described. 
But  if  we  prefer  to  avoid  any  reference  to  an  ensemble 
of  systems,  we  may  observe  that  the  probability  that  the 
phase  of  a  system  falls  within  certain  limits  at  a  certain  time, 
is  equal  to  the  probability  that  at  some  other  time  the  phase 
will  fall  within  the  limits  formed  by  phases  corresponding  to 
the  first.  For  either  occurrence  necessitates  the  other.  That 
is,  if  we  write  P*  for  the  coefficient  of  probability  of  the 
phase  pij . . .  qn'  at  the  time  tf ,  and  P"  for  that  of  the  phase 
Pi"*  •  •  •  in    at  the  time  tf ', 
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j.  .  .fFdql  .  . .  dq:  =f.  .  .fp"dPl".  . .  dqn",      (45) 

where  the  limits  in  the  two  cases  are  formed  by  corresponding 
phases.  When  the  integrations  cover  infinitely  small  vari- 
ations of  the  momenta  and  coordinates,  we  may  regard  P*  and 
Pn  as  constant  in  the  integrations  and  write 

P'f...fdp1<...dqJ'  =  P>f...fdpl»...dqn». 

Now  the  principle  of  the  conservation  of  extension-in-phase, 
which  has  been  proved  (viz.,  in^the  second  demonstration  given 
above)  independently  of  any  reference  to  an  ensemble  of 
systems,  requires  that  the  values  of  the  multiple  integrals  in 
tins  equation  shall  be  equal    This  gives 

P"  =  P*. 

With  reference  to  an  important  class  of  cases  this  principle 
may  be  enunciated  as  follows. 

When  the  differential  equations  of  motion  are  exactly  known, 
but  the  constants  of  the  integral  equations  imperfectly  deter- 
mined,  the  coefficient  of  probability  of  any  phase  at  any  time  is 
equal  to  the  coefficient  of  probability  of  the  corresponding  phase 
at  any  other  time.  By  corresponding  phases  are  meant  those 
which  are  calculated  for  different  times  from  the  same  values 
of  the  arbitrary  constants  of  the  integral  equations. 

Since  the  sum  of  the  probabilities  of  all  possible  cases  is 
necessarily  unity,  it  is  evident  that  we  must  have 
an 
J...fpdpl...dqn  =  l,  (46) 


where  the  integration  extends  over  all  phases.    This  is  indeed 
only  a  different  form  of  the  equation 

an 
N=z  J  .. .  I  Ddp!...dqn, 


which  we  may  regard  as  defining  N. 
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The  values  of  the  coefficient  and  index  of  probability  of 
phase,  like  that  of  the  density-in-phase,  are  independent  of  the 
system  of  coordinates  which  is  employed  to  express  the  distri- 
bution in  phase  of  a  given  ensemble. 

In  dimensions,  the  coefficient  of  probability  is  the  reciprocal 
of  an  extension-in-phase,  that  is,  the  reciprocal  of  the  nth 
power  of  the  product  of  time  and  energy.  The  index  of  prob- 
ability is  therefore  affected  by  an  additive  constant  when  we 
change  our  units  of  time  and  energy.  If  the  unit  of  time  is 
multiplied  by  ct  and  the  unit  of  energy  is  multiplied  by  c€ ,  all 
indices  of  probability  relating  to  systems  of  n  degrees  of 
freedom  will  be  increased  by  the  addition  of 

n  log  et  +  n  log  c„  (47) 
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CHAPTER  n. 

APPLICATION  OP  THE  PRINCIPLE  OF  CONSERVATION 

OF  EXTENSION-Itf-PHASE  TO  THE  THEORY 

OF  ERRORS. 

Let  us  now  proceed  to  combine  the  principle  which  has  been 
demonstrated  in  the  preceding  chapter  and  which  in  its  differ- 
ent applications  and  regarded  from  different  points  of  view 
has  been  variously  designated  as  the  conservation  of  density- 
in-phase,  or  of  extension-in-phase,  or  of  probability  of  phase, 
with  those  approximate  relations  which  are  generally  used  in- 
the  *  theory  of  errors/ 

We  suppose  that  the  differential  equations  of  the  motion  of 
a  system  are  exactly  known,  but  that  the  constants  of  the 
integral  equations  are  only  approximately  determined.  It  is 
evident  that  the  probability  that  the  momenta  and  coordinates 
at  the  time  tr  fall  between  the  limits  pj  and  pxf  +  dp^  q±  and 
ii  +  d<liy  ete-»  may  be  expressed  by  the  formula 

tfipl'...dsJ9  (48) 

where  rf  (the  index  of  probability  for  the  phase  in  question)  is 
a  function  of  the  coordinates  and  momenta  and  of  the  time. 

Let  QJ,  P\i  etc.  be  the  values  of  the  coordinates  and  momenta 
which  give  the  maximum  value  to  i/>  and  let  the  general 
value  of  if  be  developed  by  Taylor's  theorem  according  to 
ascending  powers  and  products  of  the  differences  pj  —  P/f 
Qi  —  (?i'i  ete->  an(i  let  to  suppose  that  we  have  a  sufficient 
approximation  without  going  beyond  terms  of  the  second 
degree  in  these  differences.    We  may  therefore  set 

V  s  o  -  F',  (49) 

where  c  is  independent  of  the  differences  ptf  —  P/,  q{  —  Qt\ 
etc.,  and  F'  is  a  homogeneous  quadratic  function  of  these 
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differences.  The  terms  of  the  first  degree  vanish  in  virtue 
of  the  maximum  condition,  which  also  requires  that  F1  must 
have  a  positive  value  except  when  all  the  differences  men- 
tioned vanish.    If  we  set 

C=e«,  (60) 

we  may  write  for  the  probability  that  the  phase  lies  within 
the  limits  considered 

Of1*  dpx' . . .  dqj.  (51) 

C  is  evidently  the  maximum  value  of  the  coefficient  of  proba- 
bility at  the  time  considered. 

In  regard  to  the  degree  of  approximation  represented  by 
these  formulae,  it  is  to  be  observed  that  we  suppose,  as  is 
usual  in  the  *  theory  of  errors,'  that  the  determination  (ex- 
plicit or  implicit)  of  the  constants  of  motion  is  of  such 
precision  that  the  coefficient  of  probability  e*  or  Ce"^  is 
practically  zero  except  for  very  small  values  of  the  differences 
p{  —  P/,  qxf  —  Qxr>  etc.  For  very  small  values  of  these 
differences  the  approximation  is  evidently  in  general  sufficient, 
for  larger  values  of  these  differences  the  value  of  Ce"^  will 
be  sensibly  zero,  as  it  should  be,  and  in  this  sense  the  formula 
will  represent  the  facts. 

We  shall  suppose  that  the  forces  to  which  the  system  is 
subject  are  functions  of  the  coordinates  either  alone  or  with 
the  time.  The  principle  of  conservation  of  probability  of 
phase  will  therefore  apply,  which  requires  that  at  any  other 
time  (I")  the  maximum  value  of  the  coefficient  of  probability 
shall  be  the  same  as  at  the  time  tf,  and  that  the  phase 
(Px",  Qx\  etc.)  which  has  this  greatest  probability-coefficient, 
shall  be  that  w^rich  corresponds  to  the  phase  (P/,  Qx\  etc.), 
t.  €.,  which  is  calculated  from  the  same  values  of  the  constants 
of  the  integral  equations  of  motion. 

We  may  therefore  write  for  the  probability  that  the  phase 
at  the  time  fl  falls  within  the  limits  pf  and  jpf  +  dpx*y  qx" 
and  qxu  +  dqx",  etc., 

C€r*"dPlH  ...dql*,  (52) 
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where  G  represents  the  same  value  as  in  the  preceding 
formula,  viz.,  the  constant  value  of  the  mftTimriMi  coefficient 
of  probability,  and  F"  is  a  quadratic  function  of  the  differences 
Pi  -  -Pi".  ?i"  -  Qi">  etc.,  the  phase  (P/',  QJ*  etc.)  being  that 
which  at  the  time  f  corresponds  to  the  phase  (Px\  Qt'9  etc.) 
at  the  time  if. 
Now  we  have  necessarily 

j. .  .Jce-rdPl'...dqn'=J. .  .fce-*"dp1»...dqn"  =  l,  (63) 

when  the  integration  is  extended  over  all  possible  phases. 
It  will  be  allowable  to  set  ±  oo  for  the  limits  of  all  the  coor- 
dinates and  momenta,  not  because  these  values  represent  the 
actual  limits  of  possible  phases,  but  because  the  portions  of 
the  integrals  lying  outside  of  the  limits  of  all  possible  phases 
will  have  sensibly  the  value  zero.  With  ±  oo  for  limits,  the 
equation  gives 

where/'  is  the  discriminant  *  of  F\  and/"  that  of  Frf.  This 
discriminant  is  therefore  constant  in  time,  and  like  C  an  abso- 
lute invariant  in  respect  to  the  system  of  coordinates  which 
may  be  employed.  In  dimensions,  like  C",  it  is  the  reciprocal 
of  the  2nth  power  of  the  product  of  energy  and  time. 

Let  us  see  precisely  how  the  functions  J"  and  F"  are  related. 
The  principle  of  the  conservation  of  the  probability-coefficient 
requires  that  any  values  of  the  coordinates  and  momenta  at  the 
time  f  shall  give  the  function  Ff  the  same  value  as  the  corre- 
sponding coordinates  and  momenta  at  the  time  t"  give  to  F". 
Therefore  F11  may  be  derived  from  F1  by  substituting  for 
Pi*  •••?»'  t^eir  values  in  terms  of  p^',  . . .  j/'.  Now  we 
have  approximately 

*  Thii  term  is  used  to  denote  the  determinant  having  for  elements  on  the 
principal  diagonal  the  coefficients  of  the  squares  in  the  quadratic  function 
F't  and  for  its  other  elements  the  halves  of  the  coefficients  of  the  products 
inF'. 
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?-'-«.'  =  |^0'i"--Pi")..-  +  ^  (?."-«."), 

and  as  in  F"  terms  of  higher  degree  than  the  second  are  to  be 
neglected,  these  equations  may  be  considered  accurate  for  the 
purpose  of  the  transformation  required.  Since  by  equation 
(33)  the  eliminant  of  these  equations 'has  the  value  unity, 
the  discriminant  of  F"  will  be  equal  to  that  of  F\  as  has 
already  appeared  from  the  consideration  of  the  principle  of 
conservation  of  probability  of  phase,  which  is,  in  fact,  essen- 
tially the  same  as  that  expressed  by  equation  (33). 
At  the  time  tf,  the  phases  satisfying  the  equation 

F'  =  k,  (66) 

where  k  is  any  positive  constant,  have  the  probability-coeffi- 
cient C  e~k.  At  the  time  £",  the  corresponding  phases  satisfy 
the  equation 

F"  =  k,  (57) 

and  have  the  same  probability-coefficient.  So  also  the  phases 
within  the  limits  given  by  one  or  the  other  of  these  equations 
are  corresponding  phases,  and  have  probability-coefficients 
greater  than  Ce~*,  while  phases  without  these  limits  have  less 
probability-coefficients.  The  probability  that  the  phase  at 
the  time  if  falls  within  the  limits  F'  =  h  is  the  same  ad  the 
probability  that  it  falls  within  the  limits  F,r  =  h  at  the  time  t\ 
since  either  event  necessitates  the  other.  This  probability 
may  be  evaluated  as  follows.  We  may  omit  the  accents,  as 
we  need  only  consider  a  single  time.  Let  us  denote  the  ex- 
tension-in-phase  within  the  limits  F  =  k  by  ?7,  and  the  prob- 
ability that  the  phase  falls  within  these  limits  by  R,  also  the 
extension-in-phase  within  the  limits  F  =  1  by  Uv  We  have 
then  by  definition 

F=k 

U=f.-.fdPl...dqn,  (58) 
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B  =  j.  . .  f  C  €TF  dpx . . .  dqn9  (59) 

#!  =  /  ...  /  dfr...  dqn.  (60) 

But  since  F  is  a  homogeneous  quadratic  function  of  the 
differences 

P\  —  -Pi>  P%  —  P%>  •  •  •  9%  —  Q»f 
we  have  identically 
*=* 
f...fd(Pl-Pl)...d(!i.-Qn) 

kF=k 

=/.  •  -JV^  -  Px)  . . .  <*(?.-&) 

=  U>j...Jd{pl-Px). ..  dfa-QJ. 

That  is  U=1fUu  (61) 

whence  d  J7  =  Pi  n  *— l  dk.  (62) 

But  if  &  varies,  equations  (58)  and  (59)  give 

F=k+dk 

dU=J...Jdpl...dq.  (63) 

F=k+<a 
d£=f...  fce-'dfr  ...dqn  (64) 

Since  the  factor  Ce~F  has  the  constant  value  (7e~*  in  the 
last  multiple  integral,  we  have 

dX=Ce-*dU=CUlnerkkr*dk,  (65) 

*  =  -  CU^r* (l  +  k  +  ^  +  . . .  +  f|^rj)  +  const.      (66) 

We  may  determine  the  constant  of  integration  by  the  condition 
that  B  vanishes  with  *.    This  gives 
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(67) 


We  may  determine  the  value  of  the  constant  Ut  by  the  con- 
dition that  R  =  1  for  k  =s  oo.    This  gives  O  Ut  (n  =  I,  and 


=  !-«-*( 


1  +  *  +  £--+ff=r)'         <«*) 


2 

*=4-  (69) 

It  is  worthy  of  notice  that  the  form  of  these  equations  de- 
pends only  on  the  number  of  degrees  of  freedom  of  the  system, 
being  in  other  respects  independent  of  its  dynamical  nature, 
except  that  the  forces  must  be  functions  of  the  coordinates 
either  alone  or  with  the  time. 

If  we  write 

*~» 

for  the  value  of  k  which  substituted  in  equation  (68)  will  give 
R  =  \,  the  phases  determined  by  the  equation 

^•=4*4  (70) 

will  have  the  following  properties. 

The  probability  that  the  phase  falls  within  the  limits  formed 
by  these  phases  is  greater  than  the  probability  that  it  falls 
within  any  other  limits  enclosing  an  equal  extension-in-phase. 
It  is  equal  to  the  probability  that  the  phase  falls  without  the 
same  limits. 

These  properties  are  analogous  to  those  which  in  the  theory 
of  errors  in  the  determination  of  a  single  quantity  belong  to 
values  expressed  by  A  ±  a,  when  J.  is  the  most  probable 
value  and  a  the  *  probable  error.' 
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CHAPTER  III. 

APPLICATION  OF  THE  PRINCIPLE  OF  CONSERVATION  OF 
EXTENSION-IN-PHA8E  TO  THE  INTEGRATION  OF  THE 
DIFFERENTIAL  EQUATIONS  OF  MOTION.* 

We  have  seen  that  the  principle  of  conservation  of  exten- 
sion-in-phase  may  be  expressed  as  a  differential  relation  be- 
tween the  coordinates  and  momenta  and  the  arbitrary  constants 
of  the  integral  equations  of  motion.  Now  the  integration  of 
the  differential  equations  of  motion  consists  in  the  determina- 
tion of  these  constants  as  functions  of  the  coordinates  and 
momenta  with  the  time,  and  the  relation  afforded  by  the  prin- 
ciple of  conservation  of  extension-in-phase  may  assist  us  in 
this  determination. 

It  will  be  convenient  to  have  a  notation  which  shall  not  dis- 
tinguish between  the  coordinates  and  momenta.  If  we  write 
rx.  . .  r^  for  the  coordinates  and  momenta,  and  a  ...  h  as  be- 
fore for  the  arbitrary  constants,  the  principle  of  which  we 
wish  to  avail  ourselves,  and  which  is  expressed  by  equation 
(87),  may  be  written 

^^>  =  func.(a,...A).  (71) 

Let  us  first  consider  the  case  in  which  the  forces  are  deter- 
mined by  the  coordinates  alone.  Whether  the  forces  are 
Conservative'  or  not  is  immaterial.  Since  the  differential 
equations  of  motion  do  not  contain  the  time  (f)  in  the  finite 
form,  if  we  eliminate  dt  from  these  equations,  we  obtain  2  n  —  1 
equations  in  rx , . . .  ru  and  their  differentials,  the  integration 
of  which  will  introduce  2  n  —  1  arbitrary  constants  which  we 
shall  call  I ...  A.    If  we  can  effect  these  integrations,  the 

•  See  Boltzmann:  "  Zntammenbang  zwischen  den  S&tzen  fiber  das  Ver- 
halten  mehratomlger  Gaunoleciile  rait  Jacobi't  Princip  det  letxten  Multi- 
plica  tori.    SiUb.  der  Wiener  Akad.,Bd.  LXIII,  Abth.  II.,  S.  679,  (1871). 


Digitized  by  VjOOQIC 


THEORY  OF  INTEGRATION.  27 

remaining  constant  (a)  will  then  be  introduced  in  the  final 
integration,  (viz.,  that  of  an  equation  containing  eft,)  and  will 
be  added  to  or  subtracted  from  t  in  the  integral  equation. 
Let  us  have  it  subtracted  from  t    It  is  evident  then  that 

£— *»  £=-;"  etc-  <?*> 

Moreover,  since  6, ...  h  and  t  —  a  are  independent  functions 
of  rlf . . .  ffe,  the  latter  variables  are  functions  of  the  former. 
The  Jacobian  in  (71)  is  therefore  function  of  6, ...  A,  and 
t  —  a,  and  since  it  does  not  vary  with  t  it  cannot  vary  with  a. 
We  have  therefore  in  the  case  considered,  viz.,  where  the 
forces  are  functions  of  the  coordinates  alone, 

^^>  =  func.(*,...A).  (73) 

Now  let  us  suppose  that  of  the  first  2  n  —  1  integrations  we 
have  accomplished  all  but  one,  determining  2  n  —  2  arbitrary 
constants  (say  (?,...  A)  as  functions  of  rl9 . . .  r^,  leaving  J  as 
well  as  a  to  be  determined.  Our  2n  —  2  finite  equations  en- 
able us  to  regard  all  the  variables  rlf . . .  rto,  and  all  functions 
of  these  variables  as  functions  of  two  of  them,  (say  rx  and  rs,) 
with  the  arbitrary  constants  c, . . .  h.  To  determine  6,  we 
have  the  following  equations  for  constant  values  of  e, ...  A. 

whence  ^^-^  +  £*,  (74) 

Now,  by  the  ordinary  formula  for  the  change  of  variables, 
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where  the  limits  of  the  multiple  integrals  are  formed  by  the 
same  phases.    Hence 

d(rlfrt)  _  d(rt,  ...ru)    d(c,  . . .  A) 
4M)  ~    d(a,...h)    d(r%,...rj  </0' 

With  the  aid  of  this  equation,  which  is  an  identity,  and  (72), 
we  may  write  equation  (74)  in  the  form 

d(a,  ...h)  d(r9r  ...rSn)  v  v     ' 

The  separation  of  the  variables  is  now  easy.  The  differen- 
tial equations  of  motion  give  rx  and  r3  in  terms  of  rlf . . .  rte. 
The  integral  equations  already  obtained  give  (?,...  A  and 
therefore  the  Jacobian  d(c,  . . .  h)/d(rz, . . .  *•*,),  in  terms  of 
the  same  variables.  But  in  virtue  of  these  same  integral 
equations,  we  may  regard  functions  of  rt , . . .  r^  as  functions 
of  rx  and  r%  with  the  constants  (?,...  A.  If  therefore  we  write 
the  equation  in  the  form 

d(rl}  ...rte) rt  j> 5 ^ 

tf(q,...A)   **""    <*(«,...*)    ^        d(e,...h)  af»      (77) 
d(rt, . . .  **)  d(r„  . . .  r,,) 

the  coefficients  of  drt  and  dra  may  be  regarded  as  known  func- 
tions of  rx  and  r2  with  the  constants  (?,...  A.  The  coefficient 
of  db  is  by  (73)  a  function  of  6, ...  A.  It  is  not  indeed  a 
known  function  of  these  quantities,  but  since  <?,...  A  are 
regarded  as  constant  in  the  equation,  we  know  that  the  first 
member  must  represent  the  differential  of  some  function  of 
by ...  A,  for  which  we  may  write  V.    We  have  thus 

*"  =  d(c9  t:  .  A>" dfi  -  d(Cl ,?. :  a)  dr%>       (78) 

d(r99...r^  d{r$f  ...r*) 

which  may  be  integrated  by  quadratures  and  gives  V  as  func- 
tions of  rx,  ra, ...(?,...  A,  and  thus  as  function  of  rx,  .  .  .  rte. 
This  integration  gives  us  the  last  of  the  arbitrary  constants 
which  are  functions  of  the  coordinates  and  momenta  without 
the  time.    The  final  integration,  which  introduces  the  remain- 
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ing  constant  (a),  is  also  a  quadrature,  since  the  equation  to 
be  integrated  may  be  expressed  in  the  form 

dt  =  F(rx)drx. 

Now,  apart  from  any  such  considerations  as  have  been  ad- 
duced, if  we  limit  ourselves  to  the  changes  which  take  place 
in  time,  we  hare  identically 

f,**  —  fi  *•,  =  <>, 

and  rt  and  rs  are  given  in  terms  of  rv ...  r^  by  the  differential 
equations  of  motion.  When  we  have  obtained  2  n  —  2  integral 
equations,  we  may  regard  r3  and  rx  as  known  functions  of  rt 
and  rr  The  only  remaining  difficulty  is  in  integrating  this 
equation.  If  the  case  is  so  simple  as  to  present  no  difficulty, 
or  if  we  have  the  skill  or  the  good  fortune  to  perceive  that  the 
multiplier 

1 

d(e,...h)  '  (79) 

d(r§9 . . .  r*) 

or  any  other,  will  make  the  first  member  of  the  equation  an 
exact  differential,  we  have  no  need  of  the  rather  lengthy  con- 
siderations which  have  been  adduced.  The  utility  of  the 
principle  of  conservation  of  extension-in-phase  is  that  it  sup- 
plies a  *  multiplier '  which  renders  the  equation  integrable,  and 
which  it  might  be  difficult  or  impossible  to  find  otherwise. 

It  will  be  observed  that  the  function  represented  by  V  is  a 
particular  case  of  that  represented  by  b.  The  system  of  arbi- 
trary constants  a,  6*,  e . . .  h  has  certain  properties  notable  for 
simplicity.  If  we  write  V  for  b  in  (77),  and  compare  the 
result  with  (78),  we  get 

d(a,  bf,  c, ...  A)  v    ' 

Therefore  the  multiple  integral 

J.  .  .fdadb'de  ...dh  (81) 
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taken  within  limits  formed  by  phases  regarded  as  contempo- 
raneous represents  the  extension-in-phase  within  those  limits. 

The  case  is  somewhat  different  when  the  forces  are  not  de- 
termined by  the  coordinates  alone,  but  are  functions  of  the 
coordinates  with  the  time.  All  the  arbitrary  constants  of  the 
integral  equations  must  then  be  regarded  in  the  general  case 
as  functions  of  rv . . .  r^,  and  t.  We  cannot  use  the  princi- 
ple of  conservation  of  extension-in-phase  until  we  have  made 
2  n  —  1  integrations.  Let  us  suppose  that  the  constants  6, . . .  h 
have  been  determined  by  integration  in  terms  of  rv  . . .  r^,  and 
t,  leaving  a  single  constant  (a)  to  be  thus  determined.  Our 
2  n  —  1  finite  equations  enable  us  to  regard  all  the  variables 
rv  •  •  •  ra»  M  functions  of  a  single  one,  say  rv 

For  constant  values  of  6, ...  A,  we  have 

<&*  =  -£;  da +  rxdt.  (82) 

Now 

I  ...  f  -j —  da  dr%  . . .  dr*,  =  /.../  drx .  . .  dru 

-/•••/3fc3tf*-* 

where  the  limits  of  the  integrals  are  formed  by  the  same 
phases.    We  have  therefore 

dt\  _  d (r7 , . . .  ty,)  tf(6, ...  A) 

tftz        <f(a,  ...  A)  <f(rt,  .  .  .  raw) '  ^    ' 

by  which  equation  (82)  may  be  reduced  to  the  form 

d(ri,...rin)  1  rx 

d(af...h)    **-  d(b,...h)  dT*~   d(b9...h)   dU     W 
<*(**>  •••**»)  d(rv.  ..r2n) 

Now  we  know  by  (71)  that  the  coefficient  of  da  is  a  func- 
tion of  a, ...  A.  Therefore,  as  i, ...  A  are  regarded  as  constant 
in  the  equation,  the  first  number  represents  the  differential 
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of  a  function  of  a,  ...  A,  which  we  may  denote  by  d.  We 
have  then 

1  n' 

d(b, ... h)       l        d(b,  ...h)    ^  (85)- 

rf(rs,  ...r*,)  d(rs,  ...rto) 

which  may  be  integrated  by  quadratures.  In  this  case  we 
may  say  that  the  principle  of  conservation  of  extension-in- 
phase  has  supplied  the  *  multiplier  * 

1 
d(b,...h)  (86) 

d(rt, . . .  rto) 

for  the  integration  of  the  equation 

dn  —  rxdt  =  Q.  (87) 

The  system  of  arbitrary  constants  a7,  6, ...  A  has  evidently 
the  same  properties  which  were  noticed  in  regard  to  the 
system  a,  b\ ...  h. 
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CHAPTER  IV. 

ON  THE  DISTRIBUTION  IN  PHASE  CALLED  CANONICAL, 
IN  WHICH  THE  INDEX  OF  PROBABILITY  IS  A  LINEAR 
FUNCTION  OF  THE  ENERGY. 

Let  us  now  give  our  attention  to  the  statistical  equilibrium 
of  ensembles  of  conservation  systems,  especially  to  those  cases 
and  properties  which  promise  to  throw  light  on  the  phenom- 
ena of  thermodynamics. 

The  condition  of  statistical  equilibrium  may  be  expressed 
in  the  form* 

*(&»+%  »H>  <» 

where  P  is  the  coefficient  of  probability,  or  the  quotient  of 
the  density-in-phase  by  the  whole  number  of  systems.  To 
satisfy  this  condition,  it  is  necessary  and  sufficient  that  P 
should  be  a  function  of  the  p's  and  j's  (the  momenta  and 
coordinates)  which  does  not  vary  with  the  time  in  a  moving 
system.  In  all  cases  which  we  are  now  considering,  the 
energy,  or  any  function  of  the  energy,  is  such  a  function. 

P=func.  (c) 

will  therefore  satisfy  the  equation,  as  indeed  appears  identi- 
cally if  we  write  it  in  the  form 


fdP  d^_dP  &_\ 

W  dfr      d^  dqj  "" 


There  are,  however,  other  conditions  to  which  P  is  subject, 
which  are  not  so  much  conditions  of  statistical  ^equilibrium,  as 
conditions  implicitly  involved  in  the  definition  of  the  coeffi- 

*  See  equations  (20),  (41),  (42),  also  the  paragraph  following  equation  (20). 
The  positions  of  any  external  bodies  which  can  affect  the  systems  are  here 
supposed  uniform  for  all  the  systems  and  constant  in  time. 
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cient  of  probability,  whether  the  case  is  one  of  equilibrium 
or  not.    These  are:  that  P  should  be  single* valued,  and 
neither  negative  nor  imaginary  for  any  phase,  and  that  ex- 
pressed by  equation  (46),  viz., 
an 
f...fpdpl...dq.  =  l.  (89) 


,-iogp-*;e, 

(90) 

t=5 

(91) 

These  considerations  exclude 

P=  e  X  constant, 

as  well  as 

P=  constant, 

as  cases  to  be  considered. 
The  distribution  represented  by 


or 


where  6  and  yfr  are  constants,  and  ©  positive,  seems  to  repre- 
sent the  most  simple  case  conceivable,  since  it  has  the  property 
that  when  the  system  consists  of  parts  with  separate  energies, 
the  laws  of  the  distribution  in  phase  of  the  separate  parts  are 
of  the  same  nature, —  a  property  which  enormously  simplifies 
the  discussion,  and  is  the  foundation  of  extremely  important 
relations  to  thermodynamics.  The  case  is  not  rendered  less 
simple  by  the  divisor  8,  (a  quantity  of  the  same  dimensions  as 
e,)  but  the  reverse,  since  it  makes  the  distribution  independent 
of  the  units  employed.  The  negative  sign  of  €  is  required  by 
(89),  which  determines  also  the  value  of  sfr  for  any  given 

©,  viz., 

f  an  « 

e  •  =/.../  e  *dpx .  ..dqn.  (92) 


When  an  ensemble  of  systems  is  distributed  in  phase  in  the 
manner  described,  i.  *.,  when  the  index  of  probability  is  a 
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linear  function  of  the  energy,  we  shall  say  that  the  ensemble  is 
canvnicaXly  distributed,  and  shall  call  the  divisor  of  the  energy 
(©)  the  modulus  of  distribution. 

The  fractional  part  of  an  ensemble  canonically  distributed 
which  lies  within  any  given  limits  of  phase  is  therefore  repre- 
sented by  the  multiple  integral 


/•••/ 


>*  dfr.^dq.  (93) 


taken  within  those  limits.  We  may  express  the  same  thing 
by  saying  that  the  multiple  integral  expresses  the  probability 
that  an  unspecified  system  of  the  ensemble  (i.  e.,  one  of 
which  we  only  know  that  it  belongs  to  the  ensemble)  falls 
within  the  given  limits. 

Since  the  value  of  a  multiple  integral  of  the  form  (23) 
(which  we  have  called  an  extension-in-phase)  bounded  by  any 
given  phases  is  independent  of  the  system  of  coordinates  by 
which  it  is  evaluated,  the  same  must  be  true  of  the  multiple 
integral  in  (92),  as  appears  at  once  if  we  divide  up  this 
integral  into  parts  so  small  that  the  exponential  factor  may  be 
regarded  as  constant  in  each.  The  value  of  ^  is  therefore  in- 
dependent of  the  system  of  coordinates  employed. 

It  is  evident  that  yfr  might  be  defined  as  the  energy  for 
which  the  coefficient  of  probability  of  phase  has  the  value 
unity.  Since  however  this  coefficient  has  the  dimensions  of 
the  inverse  nth  power  of  the  product  of  energy  and  time,* 
the  energy  represented  by  ^r  is  not  independent  of  the  units 
of  energy  and  time.  But  when  these  units  have  been  chosen, 
the  definition  of  ^  will  involve  the  same  arbitrary  constant  as 
e,  so  that,  while  in  any  given  case  the  numerical  values  of 
^  or  e  will  be  entirely  indefinite  until  the  zero  of  energy  has 
also  been  fixed  for  the  system  considered,  the  difference  yfr  —  € 
will  represent  a  perfectly  definite  amount  of  energy,  which  is 
entirely  independent  of  the  zero  of  energy  which  we  may 
choose  to  adopt 

•  See  Chapter  I,  p.  19. 
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It  is  evident  that  the  canonical  distribution  is  entirely  deter- 
mined by  the  modulus  (considered  as  a  quantity  of  energy) 
and  the  nature  of  the  system  considered,  since  when  equation 
(92)  is  satisfied  the  value  of  the  multiple  integral  (93)  is 
independent  of  the  units  and  of  the  coordinates  employed,  and 
of  the  zero  chosen  for  the  energy  of  the  system. 

In  treating  of  the  canonical  distribution,  we  shall  always 
suppose  the  multiple  integral  in  equation  (92)  to  have  a 
finite  value,  as  otherwise  the  coefficient  of  probability  van- 
ishes, and  the  law  of  distribution  becomes  illusory.  This  will 
exclude  certain  cases,  but  not  such  apparently,  as  will  affect 
the  value  of  our  results  with  respect  to  their  bearing  on  ther- 
modynamics. It  will  exclude,  for  instance,  cases  in  which  the 
system  or  parts  of  it  can  be  distributed  in  unlimited  space 
(or  in  a  space  which  has  limits,  but  is  still  infinite  in  volume), 
while  the  energy  remains  beneath  a  finite  limit.  It  also 
excludes  many  cases  in  which  the  energy  can  decrease  without 
limit,  as  when  the  system  contains  material  points  which 
attract  one  another  inversely  as  the  squares  of  their  distances. 
Cases  of  material  points  attracting  each  other  inversely  as  the 
distances  would  be  excluded  for  some  values  of  8,  and  not 
for  others.  The  investigation  of  such  points  is  best  left  to 
the  particular  cases.  For  the  purposes  of  a  general  discussion, 
it  is  sufficient  to  call  attention  to  the  assumption  implicitly 
involved  in  the  formula  (92).* 

The  modulus  8  has  properties  analogous  to  those  of  tem- 
perature in  thermodynamics.  Let  the  system  A  be  defined  as 
one  of  an  ensemble  of  systems  of  m  degrees  of  freedom 
distributed  in  phase  with  a  probability-coefficient 

e   e    , 

*  It  wiU  be  obserred  that  similar  limitations  exist  in  thermodynamics.  In 
order  that  a  mass  of  gas  can  be  in  thermodynamic  equilibrium,  it  is  necessary 
that  it  be  enclosed.  There  is  no  thermodynamic  equilibrium  of  a  (finite)  mats 
of  gas  in  an  infinite  space.  Again,  that  two  attracting  particles  should  be 
able  to  do  an  infinite  amount  of  work  in  passing  from  one  configuration 
(which  is  regarded  as  possible)  to  another,  is  a  notion  which,  although  per- 
fectly intelligible  in  a  mathematical  formula,  is  quite  foreign  to  our  ordinary 
conceptions  of  matter. 
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and  the  system  B  as  one  of  an  ensemble  of  systems  of  n 
degrees  of  freedom  distributed  in  phase  with  a  probability- 
coefficient 

e   e    , 

which  has  the  same  modulus.  Let  qv  . .  •;„,  pv . . .  pm  be  the 
coordinates  and  momenta  of  -4,  and  qm¥1 , . . .  $***>  Pm+i  >  •  •  •  2?**+* 
those  of  B.  Now  we  may  regard  the  systems  A  and  B  as 
together  forming  a  system  C,  having  t»  +  *  degrees  of  free- 
dom, and  the  coordinates  and  momenta  }19 . . .  j^,  Pv*  Pm+*- 
The  probability  that  the  phase  of  the  system  C,  as  thus  defined, 
will  fall  within  the  limits 

dpx , . . .  cJp»+«i  &9\  9  •  •  •  <%*+* 

is  evidently  the  product  of  the  probabilities  that  the  systems 
A  and  B  will  each  fall  within  the  specified  limits,  viz., 

f^gru  (94) 

We  may  therefore  regard  C  as  an  undetermined  system  of  an 
ensemble  distributed  with  the  probability-coefficient 

Wr+A+iM)  (95) 

•  e  9 

an  ensemble  which  might  be  defined  as  formed  by  combining 
each  system  of  the  first  ensemble  with  each  of  the  second. 
But  since  eA  +  eB  is  the  energy  of  the  whole  system,  and 
yfrA  and  yfrB  are  constants,  the  probability-coefficient  is  of  the 
general  form  which  we  are  considering,  and  the  ensemble  to 
which  it  relates  is  in  statistical  equilibrium  and  is  canonically 
distributed. 

This  result,  however,  so  far  as  statistical  equilibrium  is 
concerned,  is  rather  nugatory,  since  conceiving  of  separate 
systems  as  forming  a  single  system  does  not  create  any  in- 
teraction between  them,  and  if  the  systems  combined  belong  to 
ensembles  in  statistical  equilibrium,  to  say  that  the  ensemble 
formed  by  such  combinations  as  we  have  supposed  is  in  statis- 
tical   equilibrium,  is  only  to  repeat  the  data  in  different 
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words.  Let  us  therefore  suppose  that  in  forming  the  system 
C  we  add  certain  forces  acting  between  A  and  2?,  and  having 
the  force-function  —  c^«  The  energy  of  the  system  0  is  now 
€a  +  €b  +  €a&  and  an  ensemble  of  such  systems  distributed 
with  a  density  proportional  to 

llSd^Sri  (96) 

would  be  in  statistical  equilibrium.  Comparing  this  with  the 
probability-coefficient  of  C  given  above  (95),  we  see  that  if 
we  suppose  e**  (or  rather  the  variable  part  of  this  term  when 
we  consider  all  possible  configurations  of  the  systems  A  and  B) 
to  be  infinitely  small,  the  actual  distribution  in  phase  of  C 
will  differ  infinitely  little  from  one  of  statistical  equilibrium, 
which  is  equivalent  to  saying  that  its  distribution  in  phase 
will  vary  infinitely  little  even  in  a  time  indefinitely  prolonged.* 
The  case  would  be  entirely  different  if  A  and  B  belonged  to 
ensembles  having  different  moduli,  say  %A  and  6*.  The  prob- 
ability-coefficient of  C  would  then  be 

which  is  not  approximately  proportional  to  any  expression  of 
the  form  (96). 

Before  proceeding  farther  in  the  investigation  of  the  dis- 
tribution in  phase  which  we  have  called  canonical,  it  will  be 
interesting  to  see  whether  the  properties  with  respect  to 

*  It  will  be  observed  that  the  above  condition  relating  to  the  forces  which 
act  between  the  different  systems  is  entirely  analogous  to  that  which  must 
hold  in  the  corresponding  case  in  thermodynamics.  The  most  simple  test 
of  the  equality  of  temperature  of  two  bodies  is  that  they  remain  in  equilib- 
rium when  brought  into  thermal  contact  Direct  thermal  contact  implies 
molecular  forces  acting  between  the  bodies.  Now  the  test  will  fail  unless 
the  energy  of  these  forces  can  be  neglected  in  comparison  with  the  other 
energies  of  the  bodies.  Thus,  in  the  case  of  energetic  chemical  action  be- 
tween the  bodies,  or  when  the  number  of  particles  affected  by  the  forces 
acting  between  the  bodies  is  not  negligible  in  comparison  with  the  whole 
number  of  particles  (as  when  the  bodies  have  the  form  of  exceedingly  thin 
sheets),  the  contact  of  bodies  of  the  same  temperature  may  produce  con- 
siderable thermal  disturbance,  and  thus  fail  to  afford  a  reliable  criterion  of 
the  equality  of  temperature. 
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statistical  equilibrium  which  have  been  described  are  peculiar 
to  it,  or  whether  other  distributions  may  have  analogous 
properties. 

Let  if  and  if1  be  the  indices  of  probability  in  two  independ- 
ent ensembles  which  are  each  in  statistical  equilibrium,  then 
if  +  if'  will  be  the  index  in  the  ensemble  obtained  by  combin- 
ing each  system  of  the  first  ensemble  with  each  system  of  the 
second.  This  third  ensemble  will  of  course  be  in  statistical 
equilibrium,  and  the  function  of  phase  if  +  rf1  will  be  a  con- 
stant of  motion.  Now  when  infinitesimal  forces  are  added  to 
the  compound  systems,  if  if  +  if1  or  a  function  differing 
infinitesimally  from  this  in  still  a  constant  of  motion,  it  must 
be  on  account  of  the  nature  of  the  forces  added,  or  if  their  action 
is  not  entirely  specified,  on  account  of  conditions  to  which 
they  are  subject.  Thus,  in  the  case  already  considered, 
if  +  if1  is  a  function  of  the  energy  of  the  compound  system, 
and  the  infinitesimal  forces  added  are  subject  to  the  law  of 
conservation  of  energy. 

Another  natural  supposition  in  regard  to  the  added  forces 
is  that  they  should  be  such  as  not  to  affect  the  moments  of 
momentum  of  the  compound  system.  To  get  a  case  in  which 
moments  of  momentum  of  the  compound  system  shall  be 
constants  of  motion,  we  may  imagine  material  particles  con- 
tained in  two  concentric  spherical  shells,  being  prevented  from 
passing  the  surfaces  bounding  the  shells  by  repulsions  acting 
always  in  lines  passing  through  the  common  centre  of  the 
shells.  Then,  if  there  are  no  forces  acting  between  particles  in 
different  shells,  the  mass  of  particles  in  each  shell  will  have, 
besides  its  energy,  the  moments  of  momentum  about  three 
axes  through  the  centre  as  constants  of  motion. 

Now  let  us  imagine  an  ensemble  formed  by  distributing  in 
phase  the  system  of  particles  in  one  shell  according  to  the 
index  of  probability 

^--  +  ^-  +  ^-  +  ^,  (98) 

where  €  denotes  the  energy  of  the  system,  and  o>1,  «>a,  o>8,  its 
three  moments  of  momentum,  and  the  other  letters  constants. 
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In  like  manner  let  us  imagine  a  second  ensemble  formed  by 
distributing  in  phase  the  system  of  particles  in  the  other  shell 
according  to  the  index 

where  the  letters  have  similar  significations,  and  6, 0lt  fl2,  fl8 
the  same  values  as  in  the  preceding  formula.  Each  of  the 
two  ensembles  will  evidently  be  in  statistical  equilibrium,  and 
therefore  also  the  ensemble  of  compound  systems  obtained  by 
combining  each  system  of  the  first  ensemble  with  each  of  the 
second.    In  this  third  ensemble  the  index  of  probability  will  be 

*+*-e^+Hr+^+i*ir>  (100) 

where  the  four  numerators  represent  functions  of  phase  which 
are  constants  of  motion  for  the  compound  systems. 

Now  if  we  add  in  each  system  of  this  third  ensemble  infini- 
tesimal conservative  forces  of  attraction  or  repulsion  between 
particles  in  different  shells,  determined  by  the  same  law  for 
all  the  systems,  the  functions  a>t  +  o>',  o>3  +  a>8',  and  o>8  +  o>8' 
will  remain  constants  of  motion,  and  a  function  differing  in- 
finitely little  from  et  +  e'  will  be  a  constant  of  motion.  It 
would  therefore  require  only  an  infinitesimal  change  in  the 
distribution  in  phase  of  the  ensemble  of  compound  systems  to 
make  it  a  case  of  statistical  equilibrium.  These  properties  are 
entirely  analogous  to  those  of  canonical  ensembles.* 

Again,  if  the  relations  between  the  forces  and  the  coordinates 
can  be  expressed  by  linear  equations,  there  will  be  certain 
"  normal "  types  of  vibration  of  which  the  actual  motion  may 
be  regarded  as  composed,  and  the  whole  energy  may  be  divided 

*  It  would  not  be  possible  to  omit  the  term  relating  to  energy  in  the  above 
indices,  since  without  this  term  the  condition  expressed  by  equation  (89) 
cannot  be  satisfied. 

The  consideration  of  the  above  case  of  statistical  equilibrium  may  be 
made  the  foundation  of  the  theory  of  the  thermodynamic  equilibrium  of 
rotating  bodies,  —  a  subject  which  has  been  treated  by  Maxwell  in  his  memoir 
"  On  Boltzmann's  theorem  on  the  average  distribution  of  energy  in  a  system 
of  material  points/'    Cambr.  Phil  Trans.,  toL  XII,  p.  647,  (1878). 
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into  parts  relating  separately  to  vibrations  of  these  different 
types.  These  partial  energies  will  be  constants  of  motion, 
and  if  such  a  system  is  distributed  according  to  an  index 
which  is  any  function  of  the  partial  energies,  the  ensemble  will 
be  in  statistical  equilibrium.  Let  the  index  be  a  linear  func- 
tion of  the  partial  energies,  say 

A-k---k-  (101) 

Let  us  suppose  that  we  have  also  a  second  ensemble  com- 
posed of  systems  in  which  the  forces  are  linear  functions  of 
the  coordinates,  and  distributed  in  phase  according  to  an  index 
which  is  a  linear  function  of  the  partial  energies  relating  to 
the  normal  types  of  vibration,  say 

*-*?-—&•  (102) 

Since  the  two  ensembles  are  both  in  statistical  equilibrium, 
the  ensemble  formed  by  combining  each  system  of  the  first 
with  each  system  of  the  second  will  also  be  in  statistical 
equilibrium.  Its  distribution  in  phase  will  be  represented  by 
the  index 

A+A'-k--k-£<--£»      (103) 

and  the  partial  energies  represented  by  the  numerators  in  the 
formula  will  be  constants  of  motion  of  the  compound  systems 
which  form  this  third  ensemble. 

Now  if  we  add  to  these  compound  systems  infinitesimal 
forces  acting  between  the  component  systems  and  subject  to 
the  same  general  law  as  those  already  existing,  viz.,  that  they 
are  conservative  and  linear  functions  of  the  coordinates,  there 
will  stall  be  n  +  m  types  of  normal  vibration,  and  n  +  m 
partial  energies  which  are  independent  constants  of  motion. 
If  all  the  original  n  +  wi  normal  types  of  vibration  have  differ- 
ent periods,  the  new  types  of  normal  vibration  will  differ  infini- 
tesimally  from  the  old,  and  the  new  partial  energies,  which  are 
constants  of  motion,  will  be  nearly  the  same  functions  of 
phase  as  the  old.    Therefore  the  distribution  in  phase  of  the 
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ensemble  of  compound  systems  after  the  addition  of  the  sup- 
posed infinitesimal  forces  will  differ  infinitesimally  from  one 
which  would  be  in  statistical  equilibrium. 

The  case  is  not  so  simple  when  some  of  the  normal  types  of 
motion  have  the  same  periods.  In  this  case  the  addition  of 
infinitesimal  forces  may  completely  change  the  normal  types 
of  motion.  But  the  sum  of  the  partial  energies  for  all  the 
original  types  of  vibration  which  have  any  same  period,  will 
be  nearly  identical  (as  a  function  of  phase,  i.  «.,  of  the  coordi- 
nates and  momenta,)  with  the  sum  of  the  partial  energies  for 
the  normal  types  of  vibration  which  have  the  same,  or  nearly 
the  same,  period  after  the  addition  of  the  new  forces.  If, 
therefore,  the  partial  energies  in  the  indices  of  the  first  two 
ensembles  (101)  and  (102)  which  relate  to  types  of  vibration 
having  the  same  periods,  have  the  same  divisors,  the  same  will 
be  true  of  the  index  (103)  of  the  ensemble  of  compound  sys- 
tems, and  the  distribution  represented  will  differ  infinitesimally 
from  one  which  would  be  in  statistical  equilibrium  after  the 
addition  of  the  new  forces.* 

The  same  would  be  true  if  in  the  indices  of  each  of  the 
original  ensembles  we  should  substitute  for  the  term  or  terms 
relating  to  any  period  which  does  not  occur  in  the  other  en- 
semble, any  function  of  the  total  energy  related  to  that  period, 
subject  only  to  the  general  limitation  expressed  by  equation 
(89).  But  in  order  that  the  ensemble  of  compound  systems 
(with  the  added  forces)  shall  always  be  approximately  in 
statistical  equilibrium,  it  is  necessary  that  the  indices  of  the 
original  ensembles  should  be  linear  functions  of  those  partial 
energies  which  relate  to  vibrations  of  periods  common  to  the 
two  ensembles,  and  that  the  coefficients  of  such  partial  ener- 
gies should  be  the  same  in  the  two  indices. f 

*  It  it  interesting  to  compare  the  above  relations  with  the  laws  respecting 
the  exchange  of  energy  between  bodies  by  radiation,  although  the  phenomena 
of  radiations  lie  entirely  without  the  scope  of  the  present  treatise,  in  which 
the  discussion  is  limited  to  systems  of  a  finite  number  of  degrees  of  freedom. 

t  The  above  may  perhaps  be  sufficiently  illustrated  by  the  simple  case 
where  n  =  1  in  each  system.  If  the  periods  are  different  in  the  two  systems, 
they  may  be  distributed  according  to  any  functions  of  the  energies :  but  if 
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The  properties  of  canonically  distributed  ensembles  of 
systems  with  respect  to  the  equilibrium  of  the  new  ensembles 
which  may  be  formed  by  combining  each  system  of  one  en- 
semble with  each  system  of  another,  (are  therefore  not  peculiar 
to  them  in  the  sense  that  analogous  properties  do  not  belong 
to  some  other  distributions  under  special  limitations  in  regard 
to  the  systems  and  forces  considered.  Yet  the  canonical 
distribution  evidently  constitutes  the  most  simple  case  of  the 
kind,  and  that  for  which  the  relations  described  hold  with  the 
least  restrictions. 

Returning  to  the  case  of  the  canonical  distribution,  we 
shall  find  other  analogies  with  thermodynamic  systems,  if  we 
suppose,  as  in  the  preceding  chapters,*  that  the  potential 
energy  (eq)  depends  not  only  upon  the  coordinates  qx . . .  qn 
which  determine  the  configuration  of  the  system,  but  also 
upon  certain  coordinates  <*i,  at,  etc.  of  bodies  which  we  call 
external,  meaning  by  this  simply  that  they  are  not  to  be  re- 
garded as  forming  any  part  of  the  system,  although  their 
positions  affect  the  forces  which  act  on  the  system.  The 
forces  exerted  by  the  system  upon  these  external  bodies  will 
be  represented  by  —  deq\dav  —  deqldav  etc.,  while  —  deq/dqv 
.  . .  —  dcq/dqn  represent  all  the  forces  acting  upon  the  bodies 
of  the  system,  including  those  which  depend  upon  the  position 
of  the  external  bodies,  as  well  as  those  which  depend  only 
upon  the  configuration  of  the  system  itself.  It  will  be  under- 
stood that  e,  depends  only  upon  ji, . . . qu,  pif .. .pni  in  other 
words,  that  the  kinetic  energy  of  the  bodies  which  we  call 
external  forms  no  part  of  the  kinetic  energy  of  the  system. 
It  follows  that  we  may  write 

de        dc 

^=4=-^'  (104) 

although  a  similar  equation  would  not  hold  for  differentiations 
relative  to  the  internal  coordinates. 

the  periods  are  the  same  they  most  be  distributed  canonically  with  same 
modulus  in  order  that  the  compound  ensemble  with  additional  forces  may 
be  in  statistical  equilibrium. 
*  See  especially  Chapter  I,  p.  4. 
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We  always  suppose  these  external  coordinates  to  have  the 
same-values  for  all  systems  of  any  ensemble.  In  the  case  of 
a  canonical  distribution,  u  «.,  when  the  index  of  probability 
of  phase  is  a  linear  function  of  the  energy,  it  is  evident  that 
the  values  of  the  external  coordinates  will  affect  the  distribu- 
tion, since  they  affect  the  energy.    In  the  equation 

e  e=  /•••/*  e4fr- ••<*&•>  (106) 


by  which  yfr  may  be  determined,  the  external  coordinates,  ax> 
a,,  etc.,  contained  implicitly  in  e,  as  well  as  ©,  are  to  be  re- 
garded as  constant  in  the  integrations  indicated.  The  equa- 
tion indicates  that  ^r  is  a  function  of  these  constants.  If  we 
imagine  their  values  varied,  and  the  ensemble  distributed 
canonically  according  to  their  new  values,  we  have  by 
differentiation  of  the  equation 

phattt 
phaftM 

"5*v •••/£•  e^---^--et°-'    (io6> 


or,  multiplying  by  0  ee,  and  setting 

-d#  +  j£d®  =  -d®f...ieee  dpx...dqm 


+  da1f...  f^ie  e  djh  . . .  dqm 


•n  f-€ 

+  dot  J  .*-JAt6edpl...dqn    +  etc.       (107) 

phttM 
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Now  the  average  value  in  the  ensemble  of  any  quantity 
(which  we  shall  denote  in  general  by  a  horizontal  line  above 
the  proper  symbol)  is  determined  by  the  equation 

5  =J  .  •  Jue~*~dpl  ...dq%.  (108) 


Comparing  this  with  the  preceding  equation,  we  have 


<fy  =  | &% - 1  d%  -  lx  dax  -  Itiat  -  etc  (109) 


1 
Or,  since  Z-JP  =  7,  (110) 


0 

=  7> 

f_C 

• 

0 

—  7> 

and  0    =^  <1U> 

<fy  =  ^  <*0  —  -2i  dax  —  J,  do,  —  etc  (112) 

Moreover,  since  (111)  gives 

d$  —  d*  =  e^  +  ^<*0,  (113) 

we  have  also 

(£  =  —  0  cfi;  —  Ixdoi  —  Itdot  —  etc  (114) 

This  equation,  if  we  neglect  the  sign  of  averages,  is  identi- 
cal in  form  with  the  thermodynamic  equation 

,        de  +  Ax  dai  +  A%  dat  +  etc 
dv  = ^ 1  (115) 

or 

de  =  Tdtf  —  Axdai—Atdai  —  etc,  (116) 

which  expresses  the  relation  between  the  energy,  tempera- 
ture, and  entropy  of  a  body  in  thermodynamic  equilibrium, 
and  the  forces  which  it  exerts  on  external  bodies,  —  a  relation 
which  is  the  mathematical  expression  of  the  second  law  of 
thermodynamics  for  reversible  changes.  The  modulus  in  the 
statistical  equation  corresponds  to  temperature  in  the  thermo- 
dynamic equation,  and  the  average  index  of  probability  with 
its  sign  reversed  corresponds  to  entropy.  But  in  the  thermo- 
dynamic equation  the  entropy  (17)  is  a  quantity  which  is 
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only  defined  by  the  equation  itself,  and  incompletely  defined 
in  that  the  equation  only  determines  its  differential,  and  the 
constant  of  integration  is  arbitrary.  On  the  other  hand,  the 
j}  in  the  statistical  equation  has  been  completely  defined  as 
the  average  value  in  a  canonical  ensemble  of  systems  of 
the  logarithm  of  the  coefficient  of  probability  of  phase. 

We  may  also  compare  equation  (112)  with  the  thermody- 
namic equation 

f  =  —  ydT  —  Ax  dax  —  A%dat  —  etc,  (117) 

where  ^r  represents  the  function  obtained  by  subtracting  the 
product  of  the  temperature  and  entropy  from  the  energy. 

How  far,  or  in  what  sense,  the  similarity  of  these  equations 
constitutes  any  demonstration  of  the  thermodynamic  equa- 
tions, or  accounts  for  the  behavior  of  material  systems,  as 
described  in  the  theorems  of  thermodynamics,  is  a  question 
of  which  we  shall  postpone  the  consideration  until  we  have 
further  investigated  the  properties  of  an  ensemble  of  systems 
distributed  in  phase  according  to  the  law  which  we  are  con- 
sidering. The  analogies  which  have  been  pointed  out  will  at 
least  supply  the  motive  for  this  investigation,  which  will 
naturally  commence  with  the  determination  of  the  average 
values  in  the  ensemble  of  the  most  important  quantities  relating 
to  the  systems,  and  to  the  distribution  of  the  ensemble  with 
respect  to  the  different  values  of  these  quantities. 
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CHAPTER  V. 

AVERAGE  VALUES  IN  A  CANONICAL  ENSEMBLE 
OF  SYSTEMS. 

In  the  simple  but  important  case  of  a  system  of  material 
points,  if  we  use  rectangular  coordinates,  we  have  for  the 
product  of  the  differentials  of  the  coordinates 

dxxdyid»i . .  .  dxydyvdzn 

and  for  the  product  of  the  differentials  of  the  momenta 

n^dxitnidyxm^dzx  . . .  mvdxvmvdyvmvdzv. 

The  product  of  these  expressions,  which  represents  an  element 
of  extension-in-phase,  may  be  briefly  written 

nil  dx\  •  •  •  nip  dzy  dxi  •  •  •  dz*  ; 

and  the  integral 


/•••/ 


e  e  midxx . . .  nhdzydxi . . .  dz,  (118) 

will  represent  the  probability  that  a  system  taken  at  random 
from  an  ensemble  canonically  distributed  will  fall  within  any 
given  limits  of  phase. 
In  this  case 

(119) 
and 


<=«« 

+  Jm,*!1 

i 

•    •   • 

+  \mvz\\ 

=  e  e  e 

26 

.  ..  6      28 

(120) 

The  potential  eneTgy  (eq)  is  independent  of  the  velocities, 
and  if  the  limits  of  integration  for  the  coordinates  are  inde- 
pendent of  the  velocities,  and  the  limits  of  the  several  veloci- 
ties are  independent  of  each  other  as  well  as  of  the  coordinates, 
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the  multiple  integral  may  be  resolved  into  the  product  of 
integrals 

f-tg  »!*!*  *£* 

f...feedx1...d^Ce    2e  fihdii . . . fe    2e  f%di*.  (121) 

This  shows  that  the  probability  that  the  configuration  lies 
within  any  given  limits  is  independent  of  the  velocities, 
and  that  the  probability  that  any  component  velocity  lies 
within  any  given  limits  is  independent  of  the  other  component 
velocities  and  of  the  configuration. 
Since 


and 


/r. 


0     *9m1dxl  =  V2**H*,  C122) 


/:: 


the  average  value  of  the  part  of  the  kinetic  energy  due  to  the 
velocity  £#  which  is  expressed  by  the  quotient  of  these  inte- 
grals, is  J  O.  This  is  true  whether  the  average  is  taken  for 
the  whole  ensemble  or  for  any  particular  configuration, 
whether  it  is  taken  without  reference  to  the  other  component 
velocities,  or  only  those  systems  are  considered  in  which  the 
other  component  velocities  have  particular  values  or  lie 
within  specified  limits. 

The  number  of  coordinates  is  8  v  or  n.  We  have,  therefore, 
for  the  average  value  of  the  kinetic  energy  of  a  system 

^  =  }ve  =  jn®.  (124) 

This  is  equally  true  whether  we  take  the  average  for  the  whole 
ensemble,  or  limit  the  average  to  a  single  configuration. 

The  distribution  of  the  systems  with  respect  to  their  com- 
ponent velocities  follows  the  *  law  of  errors * ;  the  probability 
that  the  value  of  any  component  velocity  lies  within  any  given 
limits  being  represented  by  the  value  of  the  corresponding 
integral  in  (121)  for  those  limits,  divided  by  (2?rm6)i, 
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which  is  the  value  of  the  same  integral  for  infinite  limits. 
Thus  the  probability  that  the  value  of  a^  lies  between  any 
given  limits  is  expressed  by 

28  d^.  (126) 


wts- 


The  expression  becomes  more  simple  when  the  velocity  is 
expressed  with  reference  to  the  energy  involved.    If  we  set 


•-(*•)*» 


the    probability  that  s    lies  between    any  given   limits  is 
expressed  by 


1    r-+ 


V5J-    -  °*> 

Here  *  is  the  ratio  of  the  component  velocity  to  that  which 
would  give  the  energy  © ;  in  other  words,  &  is  the  quotient 
of  the  energy  due  to  the  component  velocity  divided  by  ©. 
The  distribution  with  respect  to  the  partial  energies  due  to 
the  component  velocities  is  therefore  the  same  for  all  the  com- 
ponent velocities. 

The  probability  that  the  configuration  lies  within  any  given 
limits  is  expressed  by  the  value  of 

Jf*  (2ir®)2  C...  Ce  e  <&,...  dz,  (127) 

for  those  limits,  where  M  denotes  the  product  of  all  the 
masses.  This  is  derived  from  (121)  by  substitution  of  the 
values  of  the  integrals  relating  to  velocities  taken  for  infinite 
limits. 

Very  similar  results  may  be  obtained  in  the  general  case  of 
a  conservative  system  of  n  degrees  of  freedom.  Since  ep  is  a 
homogeneous  quadratic  function  of  the  p%  it  may  be  divided 
into  parts  by  the  formula 


Digitized  by  VjOOQIC 


ENSEMBLE  OF  SYSTEMS.  49 

where  *  might  be  written  for  ep  in  the  differential  coefficients 
without  affecting  the  signification.  The  average  value  of  the 
first  of  these  parts,  for  any  given  configuration,  is  expressed 
by  the  quotient 

£ (129) 

I  ...  I        e  "  opi . . .  dpm 

J  —mj  — » 

Now  we  have  by  integration  by  parts 

By  substitution  of  this  value,  the  above  quotient  reduces  to 

^,  which  is  therefore  the  average  value  of  bPi-r-  for  the 
Z  dpx 

given  configuration.  Since  this  value  is  independent  of  the 
configuration,  it  must  also  be  the  average  for  the  whole 
ensemble,  as  might  easily  be  proved  directly.  (To  make 
the  preceding  proof  apply  directly  to  the  whole  ensemble,  we 
have  only  to  write  dpx  .  •  •  dqn  for  dpx  • .  •  dpn  in  the  multiple 
integrals.)  This  gives  £  n  &  for  the  average  value  of  the 
whole  kinetic  energy  for  any  given  configuration,  or  for 
the  whole  ensemble,  as  has  already  been  proved  in  the  case  of 
material  points. 

The  mechanical  significance  of  the  several  parts  into  which 
the  kinetic  energy  is  divided  in  equation  (128)  will  be  appar- 
ent if  we  imagine  that  by  the  application  of  suitable  forces 
(different  from  those  derived  from  eq  and  so  much  greater 
that  the  latter  may  be  neglected  in  comparison)  the  system 
was  brought  from  rest  to  the  state  of  motion  considered,  so 
rapidly  that  the  configuration  was  not  sensibly  altered  during 
the  process,  and  in  such  a  manner  also  that  the  ratios  of  the 
component  velocities  were  constant  in  the  process.    If  we 

write 

Fxdqx...  +  Fnd$n 
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for  the  moment  of  these  forces,  we  have  for  the  period  of  their 
action  by  equation  (8) 

The  work  done  by  the  force  Ft  may  be  evaluated  as  follows: 

where  the  last  term  may  be  cancelled  because  the  configuration 
does  not  vary  sensibly  during  the  application  of  the  forces. 
(It  will  be  observed  that  the  other  terms  contain  factors  which 
increase  as  the  time  of  the  action  of  the  forces  is  diminished.) 
We  have  therefore, 

Jfx  dqx  =Jpx  ii  dt  =  fix  dp  = Jj  Jjft  dfr.  (131) 

For  since  the  jp's  are  linear  functions  of  the  j's  (with  coeffi- 
cients involving  the  qy&)  the  supposed  constancy  of  the  j's  and 
of  the  ratios  of  the  y's  will  make  the  ratio  qx/pi  constant. 
The  last  integral  is  evidently  to  be  taken  between  the  limits 
zero  and  the  value  of  px  in  the  phase  originally  considered, 
and  the  quantities  before  the  integral  sign  may  be  taken  as 
relating  to  that  phase.    We  have  therefore 

Jfx  dqx  =  iPl  ix  =  \px  |5- .  (132) 

That  is:  the  several  parts  into  which  the  kinetic  energy  is 
divided  in  equation  (128)  represent  the  amounts  of  energy 
communicated  to  the  system  by  the  several  forces  Fx, .  . .  Fn 
under  the  conditions  mentioned. 

The  following  transformation  will  not  only  give  the  value 
of  the  average  kinetic  energy,  but  will  also  serve  to  separate 
the  distribution  of  the  ensemble  in  configuration  from  its  dis- 
tribution in  velocity. 

Since  2  e9  is  a  homogeneous  quadratic  function  of  the  p% 
which  is  incapable  of  a  negative  value,  it  can  always  be  ex- 
pressed (and  in  more  than  one  way)  as  a  sum  of  squares  of 
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linear  functions  of  the  p's*  The  coefficients  in  these  linear 
functions,  like  those  in  the  quadratic  function,  must  be  regarded 
in  the  general  case  as  functions  of  the  ;'s.    Let 

2«p  =  t41«  +  «1«. ..  +  *.*  (133) 

where  Uj . . .  u*  are  such  linear  functions  of  the  jo's.  If  we 
write 

d(px  --.pn) 

d(th  . . .  tO 

for  the  Jacobian  or  determinant  of  the  differential  coefficients 
of  the  form  dpjdu,  we  may  substitute 

din,...  un)  dui-"du* 

for  dpl . . .  dpn 

under  the  multiple  integral  sign  in  any  of  our  formulae.  It 
will  be  observed  that  this  determinant  is  function  of  the  g's 
alonei  The  sign  of  such  a  determinant  depends  on  the  rela- 
tive order  of  the  variables  in  the  numerator  and  denominator. 
But  since  the  suffixes  of  the  tt's  are  only  used  to  distinguish 
these  functions  from  one  another,  and  no  especial  relation  is 
supposed  between  a  p  and  a  u  which  have  the  same  suffix,  we 
may  evidently,  without  loss  of  generality,  suppose  the  suffixes 
so  applied  that  the  determinant  is  positive. 

Since  the  tt's  are  linear  functions  of  the  p%  when  the  in- 
tegrations are  to  cover  all  values  of  the  p7s  (for  constant  j's) 
once  and  only  once,  they  must  cover  all  values  of  the  u's  once 
and  only  once,  and  the  limits  will  be  ±  oo  f  or  all  the  u's. 
Without  the  supposition  of  the  last  paragraph  the  upper  limits 
would  not  always  be  +  oo ,  as  is  evident  on  considering  the 
effect  Of  changing  the  sign  of  a  u.  But  with  the  supposition 
which  we  have  made  (that  the  determinant  is  always  positive) 
we  may  make  the  upper  limits  +  oo  and  the  lower  —  oo  for  all 
the  tt's.  Analogous  considerations  will  apply  where  the  in- 
tegrations do  not  cover  all  values  of  the  py8  and  therefore  of 

*  The  reduction  requires  only  the  repeated  application  of  the  process  of 
'completing  the  square'  used  in  the  solution  of  quadratic  equations. 
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the  tt's.    The  integrals  may  always  be  taken  from  a  less  to  a 
greater  value  of  a  u. 

The  general  integral  which  expresses  the  fractional  part  of 
the  ensemble  which  falls  within  any  given  limits  of  phase  is 
thus  reduced  to  the  form 

For  the  average  value  of  the  part  of  the  kinetic  energy 
which  is  represented  by  £  u^  whether  the  average  is  taken 
for  the  whole  ensemble,  or  for  a  given  configuration,  we  have 
therefore 

|E,      /Sg^jfafl.      g,  (135) 

*l  ^«_"J*  (2*©)*  ~2'  V      ' 

+  e  ^du, 


J    — OB 


and  for  the  average  of  the  whole  kinetic  energy,  Jit©,  as 
before. 

The  fractional  part  of  the  ensemble  which  lies  within  any 
given  limits  of  configuration,  is  found  by  integrating  (134) 
with  respect  to  the  m's  from  —  oo  to  +  oo .    This  gives 

(2.®)*/. .  ./«vffi ;;;$*, . • .  *    w 

which  shows  that  the  value  of  the  Jacobian  is  independent  of 
the  manner  in  which  2ep  is  divided  into  a  sum  of  squares. 
We  may  verify  this  directly,  and  at  the  same  time  obtain  a 
more  convenient  expression  for  the  Jacobian,  as  follows. 

It  will  be  observed  that  since  the  it's  are  linear  functions  of 
the  />'s,  and  the  ^'s  linear  functions  of  the  y's,  the  u's  will  be 
linear  functions  of  the  q%  so  that  a  differential  coefficient  of 
the  form  du/dq  will  be  independent  of  the  q%  and  function  of 
the  ?'s  alone.  Let  us  write  dps/duw  for  the  general  element 
of  the  Jacobian  determinant.    We  have 
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dp9 d_  d<  d   r^dt    dur 

dv^      du9  dqm      du9  r=i  dur  dqm 

_  *=*  /     <Pc     cfaA  _   d    dc  _du, 


Therefore 


<*(u,...S)     <*(?,...?„)  (138> 


and 


\d(u,  ...un)J         \dQ,  . . .  q^J        dfa  ...?,) 


(139) 


These  determinants  are  all  functions  of  the  y's  alone.*  The 
last  is  evidently  the  Hessian  or  determinant  formed  of  the 
second  differential  coefficients  of  the  kinetic  energy  with  re- 
spect to  y, , . . .  j„.  We  shall  denote  it  by  Aj.  The  reciprocal 
determinant 

which  is  the  Hessian  of  the  kinetic  energy  regarded  as  func- 
tion of  the  p%  we  shall  denote  by  Ap. 
If  we  set 


-ft?  *°°        t°°    -JL        1 


■/-/ 


28        dtf!  .  .  .  du*  =  (2ire)2,         (140) 


and  4,9  =  f-+„  (141) 

*  It  wiU  be  observed  that  the  proof  of  (187)  depends  on  the  linear  relation 

between  the  «'s  and  q's,  which  makes  —r  constant  with  respect  to  the  differ- 

dqx 
entiationi  here  considered.    Compare  note  on  p.  12. 
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the  fractional  part  of  the  ensemble  which  lies  within  any 
given  limits  bf  configuration  (136)  may  be  written 

e    e    Aj  dqx . . .  dq„  (142) 


/•••/■ 


where  the  constant  ^q  may  be  determined  by  the  condition 
that  the  integral  extended  over  all  configurations  has  the  value 
unity,* 

*  In  the  simple  bat  important  case  in  which  Aj  is  independent  of  the  o^s, 
and  if  a  quadratic  function  of  the  q**,  if  we  write  *>  for  the  least  value  of  cff 
(or  of  c)  consistent  with  the  given  values  of  the  external  coordinates,  the 
equation  determining  *f  may  be  written 

e    #     =  A*y*. .  .J  €         *      dqi...  dqn. 

If  we  denote  by  q\% ...  q*  the  values  of  ql9 . . .  ?»  which  give  «,  its  least  value 
€at  it  is  evident  that  c«  —  ta  is  a  homogenous  quadratic  function  of  the  differ- 
ences 9!  —  ft',  etc,  and  that  dqlt...dqH  may  be  regarded  as  the  differentials 
of  these  differences.  The  evaluation  of  this  integral  is  therefore  analytically 
similar  to  that  of  the  integral 

f.  ..ft  rfft  .  .  .  dpn, 

for  which  we  have  found  the  value  4p  (2»e)w.  By  the  same  method,  or 
by  analogy,  we  get 


where  A^  is  the  Hessian  of  the  potential  energy  as  function  of  the  (fn.  It 
will  be  observed  that  A?  depends  on  the  forces  of  the  system  and  is  independ- 
ent of  the  masses,  while  A£  or  its  reciprocal  Ap  depends  on  the  masses  and 
is  independent  of  the  forces.  While  each  Hessian  depends  on  the  system  of 
coordinates  employed,  the  ratio  A^/Aj  is  the  same  for  all  systems. 
Multiplying  the  last  equation  by  (140),  we  have 


•<&< 


(2iref. 
For  the  average  value  of  the  potential  energy,  we  have 

y...y  («t-cj«       •    dqi...dqn 


«*-€«  = 


#     dql...dqm 


/•••/• 
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When  an  ensemble  of  systems  is  distributed  in  confenrm- 
Uon  in  the  manner  indicated  in  this  formula,  £  *.,  when  its 
distribution  in  configuration  is  the  same  as  that  of  an  en- 
semble canonically  distributed  in  phase,  we  shall  say,  without 
any  reference  to  its  velocities,  that  it  is  etmomeattg  distributed 
in  confyuration. 

For  any  given  configuration,  the  fractional  part  of  the 
systems  which  lie  within  any  given  limits  of  velocity  is 
represented  by  the  quotient  erf  the  multiple  integral 

or  its  equivalent 


/-/• 


*e       A***H. ..***, 


taken  within  those  limits  divided  by  the  value  of  the  same 
integral  for  the  limits  ±  oo.  But  the  value  of  the  second 
multiple  integral  for  the  limits  ±  oo  is  evidently 

We  may  therefore  write 

e  °    d^.^du.,  (143) 


/•••/• 


The  evaluation  of  this  expression  is  similar  to  that  of 

-H>     -H>    -i 
J . .  .J+*      #  dpx^.dpm 


/.../a     %dPl...dPm 

which  expresses  the  average  value  of  the  kinetic  energy,  and  for  which  we 
have  found  the  value  sue.    We  have  accordingly 

«i-««  =  g*e. 

Adding  the  equation 

we  hare  t  — t«  =  nei 
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or 


/•••/«   e    ±Pidpl...dpn9  (144) 


♦ir-*) 


or  again 


/.../•  •    **&...$*,  (146) 

for  the  fractional  part  of  the  systems  of  any  given  configurer 
tion  which  lie  within  given  limits  of  velocity. 

When  systems  are  distributed  in  velocity  according  to  these 
formulae,  i.  e.,  when  the  distribution  in  velocity  is  like  that  in 
an  ensemble  which  is  canonically  distributed  in  phase,  we 
shall  say  that  they  are  canonically  distributed  in  velocity. 

The  fractional  part  of  the  whole  ensemble  which  falls 
within  any  given  limits  of  phase,  which  we  have  before 
expressed  in  the  form 

izi 

e*  dfr...  dpndqx .  . .  dqn9  (146) 


/•••/ 


may  also  be  expressed  in  the  form 

e  e  ^dqx  . . .  dqmdq1  . .  .  dq„  (147) 


/•••/• 
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CHAPTER  VI. 

EXTENSION  IN  CONFIGURATION  AND  EXTENSION 
IN  VELOCITY. 

The  formulae  relating  to  canonical  ensembles  in  the  closing 
paragraphs  of  the  last  chapter  suggest  certain  general  notions 
and  principles,  which  we  shall  consider  in  this  chapter,  and 
which  are  not  at  all  limited  in  their  application  to  the  canon- 
ical law  of  distribution.* 

We  have  seen  in  Chapter  IV.  that  the  nature  of  the  distribu- 
tion which  we  have  called  canonical  is  independent  of  the 
system  of  coordinates  by  which  it  is  described,  being  deter- 
mined entirely  by  the  modulus.  It  follows  that  the  value 
represented  by  the  multiple  integral  (142),  which  is  the  frac- 
tional part  of  the  ensemble  which  lies  within  certain  limiting 
configurations,  is  independent  of  the  system  of  coordinates, 
being  determined  entirely  by  the  limiting  configurations  with 
the  modulus.  Now  yjry  as  we  have  already  seen,  represents 
a  value  which  is  independent  of  the  system  of  coordinates 
by  which  it  is  defined.  The  same  is  evidently  true  of 
Vv  by  equation  (140),  and  therefore,  by  (141),  of  ^q. 
Hence  the  exponential  factor  in  the  multiple  integral  (142) 
represents  a  value  which  is  independent  of  the  system  of 
coordinates.  It  follows  that  the  value  of  a  multiple  integral 
of  the  form 

j.  .  .f±*dqx  ...dqn  (148) 

*  These  notions  and  principles  are  in  fact  such  as  a  more  logical  arrange- 
ment of  the  subject  would  place  in  connection  with  those  of  Chapter  I.,  to 
which  they  are  closely  related.  The  strict  requirements  of  logical  order 
have  been  sacrificed  to  the  natural  development  of  the  subject,  and  very 
elementary  notions  have  been  left  until  they  have  presented  themselves  in 
the  study  of  the  leading  problems. 
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is  independent  of  the  system  of  coordinates  which  is  employed 
for  its  evaluation,  as  will  appear  at  once,  if  we  suppose  the 
multiple  integral  to  be  broken  up  into  parts  so  small  that 
the  exponential  factor  may  be  regarded  as  constant  in  each. 
In  the  same  way  the  formulae  (144)  and  (145)  which  express 
the  probability  that  a  system  (in  a  canonical  ensemble)  of  given 
configuration  will  fall  within  certain  limits  of  velocity,  show 
that  multiple  integrals  of  the  form 

f...f±pidpi...dpn  (149) 

o*  /•••/*;*«&...«&  (150) 

relating  to  velocities  possible  for  a  given  configuration,  when 
the  limits  are  formed  by  given  velocities,  have  values  inde- 
pendent of  the  system  of  coordinates  employed. 

These  relations  may  easily  be  verified  directly.    It  has  al- 
ready been  proved  that 

d(Pl9...PJ  =  d(ql...qJ     _d(qu...qn) 
d(j>i,  •  •  -i>0       d(Qu  . . .  &)      d(Qu  ...Qm) 

where  qx , . . .  q^Pi  >  •  •  •  jft  and  Qx , . . .  Qu9  Px , . . .  Pn  are  two 
systems  of  coordinates  and  momenta.*    It  follows  that 

"J  "'J  W»..*J  U^oJ  \M^)  dQl"dQ- 

•  See  equation  (29). 
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and 

-J-v  www;  U-J  W^-oJ  **"*• 

The  multiple  integral 

f . .  .J dfr  . . .  dpudqt  ...dqu9  (151) 

which  may  also  be  written 

J. .  A  L^dqx  . .  .  dqndqx  .  . .  dqn,  (152) 

and  which,  when  taken  within  any  given  limits  of  phase,  has 
been  shown  to  have  a  value  independent  of  the  coordinates 
employed,  expresses  what  we  have  called  an  exten*ion~in- 
pha$e.m  In  like  manner  we  may  say  that  the  multiple  integral 
(148)  expresses  an  extenrim^n-^xmfiguration,  and  that  the 
multiple  integrals  (149)  and  (150)  express  an  extention+n- 
velocity.    We  have  called 

dpi  . . .  dpndqx  ...dqnf  (153) 

which  is  equivalent  to 

*;<&  .  . .  dqmdqx  ...dq%f  (154) 

an  element  of  extension-in-phase.    We  may  call 

^dq1...dqu  (IBS) 

an  element  of  extensioir-in-configuration,  and 

t^dfr  ...dp„  (156) 

•  See  Chapter  I,  p.  NX 
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or  its  equivalent 

±}dqi  •  •  •  <&>  (157) 

an  element  of  extension-in-velocity. 

An  extension-in-phase  may  always  be  regarded  as  an  integral 
of  elementary  extensions-in-configuration  multiplied  each  by 
an  extension-in-velocity.  This  is  evident  from  the  formulae 
(151)  and  (152)  which  express  an  extension-in-phase,  if  we 
imagine  the  integrations  relative  to  velocity  to  be  first  carried 
out. 

The  product  of  the  two  expressions  for  an  element  of 
extension-in-velocity  (149)  and  (150)  is  evidently  of  the  same 
dimensions  as  the  product 

Pi  •  •  'Pn^l  •  ••£» 

that  is,  as  the  nth  power  of  energy,  since  every  product  of  the 
form  px  qx  has  the  dimensions  of  energy.  Therefore  an  exten- 
sion-in-velocity has  the  dimensions  of  the  square  root  of  the 
nth  power  of  energy.  Again  we  see  by  (155)  and  (156)  that 
the  product  of  an  extension-in-configuration  and  an  extension- 
in-velocity  have  the  dimensions  of  the  nth  power  of  energy 
multiplied  by  the  nth  power  of  time.  Therefore  an  extension- 
in-configuration  has  the  dimensions  of  the  nth  power  of  time 
multiplied  by  the  square  root  of  the  nth  power  of  energy. 

To  the  notion  of  extension-in-configuration  there  attach 
themselves  certain  other  notions  analogous  to  those  which  have 
presented  themselves  in  connection  with  the  notion  of  ex- 
tension-in-phase. The  number  of  systems  of  any  ensemble 
(whether  distributed  canonically  or  in  any  other  manner) 
which  are  contained  in  an  element  of  extension-in-configura- 
tion, divided  by  the  numerical  value  of  that  element,  may  be 
called  the  dentity-irircanfiguration.  That  is,  if  a  certain  con- 
figuration is  specified  by  the  coordinates  ^...j,,  and  the 
number  of  systems  of  which  the  coordinates  fall  between  the 
limits  q1  and  qx  +  dqx , . . .  qn  and  q%  +  dqn  is  expressed  by 

D§±*dqx...dq„  (158) 
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Dq  will  be  the  denflity-in-confignration.    And  if  we  set 

*  =  3f>  (159) 

where  N  denotes,  as  usual,  the  total  number  of  systems  in  the 
ensemble,  the  probability  that  an  unspecified  system  of  the 
ensemble  will  fall  within  the  given  limits  of  configuration,  is 
expressed  by 

e^fifdqt  ...dqu.  (160) 

We  may  call  e**  the  coefficient  of  probability  of  the  configura- 
tion, and  t)q  the  index  of  probability  of  the  configuration. 

The  fractional  part  of  the  whole  number  of  systems  which 
are  within  any  given  limits  of  configuration  will  be  expressed 
by  the  multiple  integral 

e^£^dqx  ...dqu.  (161) 


/•••/ 


The  value  of  this  integral  (taken  within  any  given  configura- 
tions) is  therefore  independent  of  the  system  of  coordinates 
which  is  used.  Since  the  same  has  been  proved  of  the  same 
integral  without  the  factor  e\  it  follows  that  the  values  of 
rfq  and  Dq  for  a  given  configuration  in  a  given  ensemble  are 
independent  of  the  system  of  coordinates  which  is  used. 

The  notion  of  extension-in-velocity  relates  to  systems  hav- 
ing the  same  configuration.*  If  an  ensemble  is  distributed 
both  in  configuration  and  in  velocity,  we  may  confine  our 
attention  to  those  systems  which  are  contained  within  certain 
infinitesimal  limits  of  configuration,  and  compare  the  whole 
number  of  such  systems  with  those  which  are  also  contained 

*  Except  in  some  simple  cases,  such  as  a  system  of  material  points,  we 
cannot  compare  velocities  in  one  configuration  with  velocities  in  another,  and 
speak  of  their  identity  or  difference  except  in  a  sense  entirely  artificial  We 
may  indeed  say  that  we  call  the  velocities  in  one  configuration  the  same  as 
those  in  another  when  the  quantities  qlt . . .  gn  have  the  same  values  in  the 
two  cases.  But  this  signifies  nothing  until  the  system  of  coordinates  has 
been  defined.  We  might  identify  the  velocities  in  the  two  cases  which  make 
the  quantities  pu  . . .  pn  the  same  in  each.  This  again  would  signify  nothing 
independently  of  the  system  of  coordinates  employed. 
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within  certain  infinitesimal  limits  of  velocity.  The  second 
of  these  numbers  divided  by  the  first  expresses  the  probability 
that  a  system  which  is  only  specified  as  falling  within  the  in- 
finitesimal limits  of  configuration  shall  also  fall  within  the 
infinitesimal  limits  of  velocity.  If  the  limits  with  respect  to 
velocity  are  expressed  by  the  condition  that  the  momenta 
shall  fall  between  the  limits  pl  and  px  +  dpt, . .  .pn  and 
P%  +  dpm  the  extension-in-velocity  within  those  limits  will  be 

±P*Pl  •  •  •  dPn* 

and  we  may  express  the  probability  in  question  by 

f**fdpx . . .  dpn.  (162) 

This  may  be  regarded  as  defining  r)p. 

The  probability  that  a  system  which  is  only  specified  as 
having  a  configuration  within  certain  infinitesimal  limits  shall 
also  fall  within  any  given  limits  of  velocity  will  be  expressed 
by  the  multiple  integral 

J. .  .ff>*fdpx . . .  dPnf  (163) 

or  its  equivalent 

taken  within  the  given  limits. 

It  follows  that  the  probability  that  the  system  will  fall 
within  the  limits  of  velocity,  qt  and  $,  +  dq1$ . . .  qn  and 
in  +  dft*  is  expressed  by 

e***dqx...din.  (166) 

The  value  of  the  integrals  (163),  (164)  is  independent  of 
the  system  of  coordinates  and  momenta  which  is  used,  as  is 
also  the  value  of  the  same  integrals  without  the  factor 
e1*;  therefore  the  value  of  rjp  must  be  independent  of  the 
system  of  coordinates  and  momenta.  We  may  call  e*9  the 
coefficient  of  probability  of  velocity,  and  i;p  the  index  of  proba- 
bility of  velocity. 
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Comparing  (160)  and  (162)  with  (40),  we  get 

«V>  =  P  =  «*  (166) 

OT  *  +  *  =  *  (Mf) 

That  is :  the  product  of  the  coefficients  of  probability  of  con- 
figuration and  of  velocity  is  equal  to  the  coefficient  of  proba- 
bility of  phase;  the  sum  of  the  indices  of  probability  of 
configuration  and  of  velocity  is  equal  to  the  index  of 
probability  of  phase. 

It  is  evident  that  e**  and  •*  have  the  dimensions  of  the 
reciprocals  of  extension-in-configuration  and  extension-in- 
velocity  respectively,  i.  e*,  the  dimensions  of  t~*  c~»  and  r-*, 
where  t  represent  any  time,  and  e  any  energy.  If,  therefore, 
the  unit  of  time  is  multiplied  by  e„  and  the  unit  of  energy  by 
c«,  every  i|f  will  be  increased  by  the  addition  of 

n  log  c,  +  in  log  e„  (168) 

and  every  rjp  by  the  addition  of 

in  logo.*  (169) 

It  should  be  observed  that  the  quantities  which  have  been 
called  extension-in-configuration  and  cxtension-in-velocity  are 
not,  as  the  terms  might  seem  to  imply,  purely  geometrical  or 
kinematical  conceptions.  To  express  their  nature  more  fully, 
they  might  appropriately  have  been  called,  respectively,  the 
dynamical  measure  of  the  extension  in  configuration,  and  the 
dynamical  measure  of  the  extension  in  velocity.  They  depend 
upon  the  masses,  although  not  upon  the  forces  of  the 
system.  In  the  simple  case  of  material  points,  where  each 
point  is  limited  to  a  given  space,  the  extension-in-configuration 
is  the  product  of  the  volumes  within  which  the  several  points 
are  confined  (these  may  be  the  same  or  different),  multiplied 
by  the  square  root  of  the  cube  of  the  product  of  the  masses  of 
the  several  points.  The  extension-in- velocity  for  such  systems 
is  most  easily  defined  as  the  extension-in-configuration  of 
systems  which  have  moved  from  the  same  configuration  for 
the  unit  of  time  with  the  given  velocities. 
•  Compare  (47)  in  Chapter  I. 
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In  the  general  case,  the  notions  of  eztension-in-configuration 
and  extension-in-velocity  may  be  connected  as  follows. 

If  an  ensemble  of  similar  systems  of  n  degrees  of  freedom 
have  the  same  configuration  at  a  given  instant,  but  are  distrib- 
uted throughout  any  finite  extension-in-velocity,  the  same 
ensemble  after  an  infinitesimal  interval  of  time  St  will  be 
distributed  throughout  an  extension  in  configuration  equal  to 
its  original  extension-in-velocity  multiplied  by  &*. 

In  demonstrating  this  theorem,  we  shall  write  ;/, . . .  qj  for 
the  initial  values  of  the  coordinates.  The  final  values  will 
evidently  be  connected  with  the  initial  by  the  equations 

fc-fc'  =  ft*.«  •?»-?»'  =  fc*  (170) 

Now  the  original  extension-in-velocity  is  by  definition  repre- 
sented by  the  integral 

f...f^di1...dqn9  (171) 

where  the  limits  may  be  expressed  by  an  equation  of  the  form 
*<i,...iO  =  fc  (172) 

The  same  integral  multiplied  by  the  constant  &*  may  be 
written 

f. .  . JV*&*). . . .  d&fc),  (173) 

and  the  limits  may  be  written 

F  tii  •  •  •  i)  =  /  (ft  «,  . . .  in  SO  =  0.  (174) 

(It  will  be  observed  that  St  as  well  as  Ag  is  constant  in  the 
integrations.)    Now  this  integral  is  identically  equal  to 

J. .  .J  Af*d(?i  -  fr')  . . .  d(qm  . . .  qj),  (175) 

or  its  equivalent 

f--f*ldql...dqn,  (176) 

with  limits  expressed  by  the  equation 

yfei-^...^-^)=0.  (177) 
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But  the  systems  which  initially  had  velocities  satisfying  the 
equation  (172)  will  after  the  interval  St  have  configurations 
satisfying  equation  (177).  Therefore  the  extension-in-con- 
figuration  represented  by  the  last  integral  is  that  which 
belongs  to  the  systems  which  originally  had  the  extension-in- 
velocity  represented  by  the  integral  (171). 

Since  the  quantities  which  we  have  called  extensions-bi- 
phase, extensions-in-configuiation9  and  extensions-in-velocity 
are  independent  of  the  nature  of  the  system  of  coordinates 
used  in  their  definitions,  it  is  natural  to  seek  definitions  which 
shall  be  independent  of  the  use  of  any  coordinates.  It  will  be 
sufficient  to  give  the  following  definitions  without  formal  proof 
of  their  equivalence  with  those  given  above,  since  they  are 
less  convenient  for  use  than  those  founded  on  systems  of  co- 
ordinates, and  since  we  shall  in  fact  have  no  occasion  to  use 
them. 

We  commence  with  the  definition  of  extension-in-velocity. 
We  may  imagine  n  independent  velocities,  Vv . . .  Vn  of  which  a 
system  in  a  given  configuration  is  capable.  We  may  conceive 
of  the  system  as  having  a  certain  velocity  V0  combined  with  a 
part  of  each  of  these  velocities  Vx . . .  Vu.  By  a  part  of  Vx  is 
meant  a  velocity  of  the  same  nature  as  Vx  but  in  amount  being 
anything  between  zero  and  Vv  Now  all  the  velocities  which 
may  be  thus  described  may  be  regarded  as  forming  or  lying  in 
a  certain  extension  of  which  we  desire  a  measure.  The  case 
is  greatly  simplified  if  we  suppose  that  certain  relations  exist 
between  the  velocities  V\ , . . .  Vny  viz :  that  the  kinetic  energy 
due  to  any  two  of  these  velocities  combined  is  the  sum  of  tie 
kinetic  energies  due  to  the  velocities  separately.  In  this  case 
the  extension-in-motion  is  the  square  root  of  the  product  of 
the  doubled  kinetic  energies  due  to  the  n  velocities  Vl9...Vw 
taken  separately. 

The  more  general  case  may  be  reduced  to  this  simpler  case 
as  follows.  The  velocity  Fa  may  always  be  regarded  as 
composed  of  two  velocities  Vj  and  VJ\  of  which  VJ  is  of 
the  same  nature  as  Vt ,  (it  may  be  more  or  less  in  amount,  or 
opposite  in  sign,)  while   VJ'  satisfies  the  relation  that  the 


Digitized  by  VjOOQIC 


66  EXTENSION  IN  CONFIGURATION 

kinetic  energy  due  to  Vx  and  V2,f  combined  is  the  Bum  of  the 
kinetic  energies  due  to  these  velocities  taken  separately.  And 
the  velocity  Vz  may  be  regarded  as  compounded  of  three, 
V{%  y%%  y%\  of  which  r8'  is  of  the  same  nature  as  Vx%  Vs" 
of  the  same  nature  as  V%'\  while  V9'"  satisfies  the  relations 
that  if  combined  either  with  Vx  or  V%"  the  kinetic  energy  of 
the  combined  velocities  is  the  sum  of  the  kinetic  energies  of 
the  velocities  taken  separately.  When  all  the  velocities 
V% , . . .  Vn  have  been  thus  decomposed,  the  square  root  of  the 
product  of  the  doubled  kinetic  energies  of  the  several  velocities 
Vi>  V%\  V{"%  ©to.,  will  be  the  value  of  the  extension-in- 
velocity  which  is  sought. 

This  method  of  evaluation  of  the  extension-in-velocity  which 
we  are  considering  is  perhaps  the  most  simple  and  natural,  but 
the  result  may  be  expressed  in  a  more  symmetrical  form.  Let 
us  write  e1%  for  the  kinetic  energy  of  the  velocities  Vx  and  V% 
combined,  diminished  by  the  sum  of  the  kinetic  energies  due 
to  the  same  velocities  taken  separately.  This  may  be  called 
the  mutual  energy  of  the  velocities  Vx  and  V%.  Let  the 
mutual  energy  of  every  pair  of  the  velocities  Vx , . . .  Vn  be 
expressed  in  the  same  way.  Analogy  would  make  en  represent 
the  energy  of  twice  Vx  diminished  by  twice  the  energy  of  Vx , 
t.  e.,  €n  would  represent  twice  the  energy  of  Vx ,  although  the 
term  mutual  energy  is  hardly  appropriate  to  this  case-  At  all 
events,  let  exx  have  this  signification,  and  eM  represent  twice 
the  energy  of  V%9  etc.    The  square  root  of  the  determinant 

*U  €M  .  .  .  €1* 
*tt  €«  •  •  •  €* 


represents  the  value  of  the  extension-in-velocity  determined  as 
above  described  by  the  velocities  Vx , . . .  V%. 

The  statements  of  the  preceding  paragraph  may  be  readily 
proved  from  the  expression  (157)  on  page  60,  viz., 

t£dqx .  . .  dq% 

by  which  the  notion  of  an  element  of  extension-in-velocity  was 
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originally  defined.  Since  Jj  in  this  expression  represents 
the  determinant  of  which  the  general  element  is 

the  square  of  the  preceding  expression  Represents  the  determi- 
nant of  which  the  general  element  is 

Now  we  may  regard  the  differentials  of  velocity  dq*  dqj  as 
themselves  infinitesimal  velocities.  Then  the  last  expression 
represents  the  mutual  energy  of  these  velocities,  and 

d*€     . 

represents  twice  the  energy  due  to  the  velocity  dq4. 

The  case  which  we  have  considered  is  an  extension-in-veloc- 
ity  of  the  simplest  form.  All  extensions-in-velocity  do  not 
have  this  form,  bat  all  may  be  regarded  as  composed  of 
elementary  extensions  of  this  form,  in  the  same  manner  as 
all  volumes  may  be  regarded  as  composed  of  elementary 
parallelepipeds. 

Having  thus  a  measure  of  extension-in-velocity  founded,  it 
will  be  observed,  on  the  dynamical  notion  of  kinetic  energy, 
and  not  involving  an  explicit  mention  of  coordinates,  we  may 
derive  from  it  a  measure  of  extension-in-configuration  by  the 
principle  connecting  these  quantities  which  has  been  given  in 
a  preceding  paragraph  of  this  chapter. 

The  measure  of  extension-in-phase  may  be  obtained  from 
that  of  extension-in-configuration  and  of  extension-in-velocity. 
For  to  every  configuration  in  an  extension-in-phase  there  will 
belong  a  certain  extension-in-velocity,  and  the  integral  of  the 
elements  of  extension-in-configuration  within  any  extension- 
in-phase  multiplied  each  by  its  extension-in-velocity  is  the 
measure  of  the  extension-in-phase. 
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CHAPTER  VH. 

FARTHER   DISCUSSION    OF  AVERAGES  IN  A  CANONICAL 
ENSEMBLE  OF  SYSTEMS. 

Returning  to  the  case  of  a  canonical  distribution,  we  have 
for  the  index  of  probability  of  configuration 

*»*£*  (178) 

as  appears  on  comparison  of  formulae  (142)  and  (161).  It 
follows  immediately  from  (142)  that  the  average  value  in  the 
ensemble  of  any  quantity  u  which  depends  on  the  configura- 
tion alone  is  given  by  the  formula 

r  *  r    fcl? 
«=*J...J«<    e    ^dqx...dqn9  (179) 


where  the  integrations  cover  all  possible  configurations.    The 
value  of  yftg  is  evidently  determined  by  the  equation 

_&      r  *  r  _S 

e   e  = J  . .  J  e  %*<*?! . . .  dqn.  (180) 

OOlmf. 

By  differentiating  the  last  equation  we  may  obtain  results 
analogous  to  those  obtained  in  Chapter  IV  from  the  equation 

-  e 


phuet 


As  the  process  is  identical,  it  is  sufficient  to  give  the  results : 
<fyt  =  ^de  -  Ix dax  -  J^dat  -  etc,  (181) 


Digitized  by  VjOOQIC 


AVERAGES  IN  A   CANONICAL  ENSEMBLE.  69 

or,  since  ^f  =  ^  +  e^ ,  (182) 

and  <tyf  =  <fi;  +  v»  +  ed}f,  (183) 

d^  =  -  e<*Jf  -  Jx<fth  -  Jtrfo,  -  etc  (184) 

It  appears  from  this  equation  that  the  differential  relations 
subsisting  between  the  average  potential  energy  in  an  ensem- 
ble of  systems  canonically  distributed,  the  modulus  of  distri- 
bution, the  average  index  of  probability  of  configuration,  taken 
negatively,  and  the  average  forces  exerted  on  external  bodies, 
are  equivalent  to  those  enunciated  by  Clausius  for  the  potential 
energy  of  a  body,  its  temperature,  a  quantity  which  he  called 
the  disgregation,  and  the  forces  exerted  on  external  bodies.* 

For  the  index  of  probability  of  velocity,  in  the  case  of  ca- 
nonical distribution,  we  have  by  comparison  of  (144)  and  (168), 
or  of  (145)  and  (164), 

^^p  (186) 

which  gives  fo  =   f  -  *f;  (186) 

we  have  also  s  =  in®>  (187) 

and  by  (140),  +p  =  -  i  n  0  log  (2*-3).  (188) 

From  these  equations  we  get  by  differentiation 

#,  =  *<»,  (189) 

and  «^  =  -e^  (190) 

The  differential  relation  expressed  in  this  equation  between 
the  average  kinetic  energy,  the  modulus,  and  the  average  index 
of  probability  of  velocity,  taken  negatively,  is  identical  with 
that  given  by  Clausius  loci*  citatu  for  the  kinetic  energy  of  a 
body,  the  temperature,  and  a  quantity  which  he  called  the 
transformation-value  of  the  kinetic  energy,  f    The  relations 

«  =  <i  +  V        V  =  v<  +  Vp 

•  Pogg.  Ann.,  Bd.  CXVI,  S.  78,  (1802) ;  ibid.,  Bd.  CXXV,  S.  858,  (1865). 
See  alio  Boltzmann,  Sitzb.  der  Wiener  Akad.,  Bd.  LXTTT,  8.  728,  (1871). 
t  Yerwandlungtwerth  dee  Wftrmeinhaltee. 
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are  also  identical  with  those  given  by  Clausius  for  the  corre- 
sponding quantities. 

Equations  (112)  and  (181)  dhow  that  if  ^r  or  ^q  is  known 
as  function  of  6  and  a19  a,,  etc.,  we  can  obtain  by  differentia- 
tion e  or  €g,  and  Al9  A%y  etc.  as  functions  of  the  same  varia- 
bles.   We  have  in  fact 

1  =  ^-0^=^-0  3§-  (191) 

The  corresponding  equation  relating  to  kinetic  energy, 

which  may  be  obtained  in  the  same  way,  may  be  verified  by 
the  known  relations  (186),  (187),  and  (188)  between  the 
variables.    We  have  also 

*— £--$         <im> 

etc.,  so  that  the  average  values  of  the  external  forces  may  be 
derived  alike  from  ^r  or  from  ^r 

The  average  values  of  the  squares  or  higher  powers  of  the 
energies  (total,  potential,  or  kinetic)  may  easily  be  obtained  by 
repeated  differentiations  of  ^r,  yfrqy  yfrpy  or  i,  eq9  ep,  with 
respect  to  6.    By  equation  (108)  we  have 


an 

pbttM 


t«  e  dfr  . . .  dq„  (196) 


and  differentiating  with  respect  to  ©, 


■-/■•Vc^+sS''*'*-*.  « 

phftftM 


whence,  again  by  (108), 


de  __  P  —  ifj      e  <fy 
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Combining  this  with  (191), 

?=7+*£=(*-»£)'-»&    <"•> 

In  precisely  the  same  way,  from  the  equation 

/•n   r    lea 
...JV  e  ±itd9l...dgm9  (199) 

we  may  obtain 

In  the  same  way  also,  if  we  confine  ourselves  to  a  particular 
configuration,  from  the  equation 

...Jef  e    V*i  •••<&»  (201) 

we  obtain 

?-«,+«a-(*-«*)-*9i    » 

which  by  (187)  reduces  to 

?=(*n»+*n)e».  (203) 

Since  this  value  is  independent  of  the  configuration,  we  see 
that  the  average  square  of  the  kinetic  energy  for  every  configu- 
ration is  the  same,  and  therefore  the  same  as  for  the  whole 
ensemble.  Hence  e?  may  be  interpreted  as  the  average  either 
for  any  particular  configuration,  or  for  the  whole  ensemble. 
It  will  be  observed  that  the  value  of  this  quantity  is  deter- 
mined entirely  by  the  modulus  and  the  number  of  degrees  of 
freedom  of  the  system,  and  is  in  other  respects  independent  of 
the  nature  of  the  system. 

Of  especial  importance  are  the  anomalies  of  the  energies,  or 
their  deviations  from  their  average  values.    The  average  value 
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of  these  anomalies  is  of  coarse  zero.    The  natural  measure  of 
such  anomalies  is  the  square  root  of  their  average  square.    Now 

(c-c)a=:?-r,  (204) 

identically.    Accordingly 

In  like  manner, 
Hence 


(207) 


(c  -  «)■  =  (c,  -  y  +  (^  -  V-  (208) 

Equation  (206)  shows  that  the  value  of  cGq/dS  can  never  be 
negative,  and  that  the  value  of  cP^/rfO2  or  d«jq/d&  can  never 
be  positive.* 

To  get  an  idea  of  the  order  of  magnitude  of  these  quantities, 
we  may  use  the  average  kinetic  energy  as  a  term  of  comparison, 
this  quantity  being  independent  of  the  arbitrary  constant  in- 
volved in  the  definition  of  the  potential  energy.    Since 

•  In  the  case  discussed  in  the  note  on  page  64,  in  which  the  potential 
energy  is  a  quadratic  function  of  the  q'§,  and  A£  independent  of  the  q%  we 
should  get  for  the  potential  energy 

(fff-tipssgne*, 

and  for  the  total  energy  

(«-i)*  =  ne*. 
We  may  also  write  in  this  case, 
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(209) 

(210) 

(€-e)»_2cC  _2      2di,. 

(211) 

These  equations  show  that  when  the  number  of  degrees  of 
freedom  of  the  systems  is  very  great,  the  mean  squares  of  the 
anomalies  of  the  energies  (total,  potential,  and  kinetic)  are  very 
small  in  comparison  with  the  mean  square  of  the  kinetic 
energy,  unless  indeed  the  differential  coefficient  dejde,  is 
of  the  same  order  of  magnitude  as  n.  Such  values  of  cUJfe, 
can  only  occur  within  intervals  (ej1  —  *pf)  which  are  of  the  or- 
der of  magnitude  of  nr\  unless  it  be  in  cases  in  which  eq  is  in 
general  of  an  order  of  magnitude  higher  than  ep.  Postponing 
for  the  moment  the  consideration  of  such  cases,  it  will  be  in- 
teresting to  examine  more  closely  the  case  of  large  values  of 
deq/ctej,  within  narrow  limits.  Let  us  suppose  that  for  epf  and 
ip"  the  value  of  d€q/dep  is  of  the  order  of  magnitude  of  unity, 
but  between  these  values  of  ep  very  great  values  of  the  differ- 
ential coefficient  occur.  Then  in  the  ensemble  having  modulus 
6"  and  average  energies  ep"  and  ctf",  values  of  eq  sensibly  greater 
than  eg"  will  be  so  rare  that  we  may  call  them  practically  neg- 
ligible. They  will  be  still  more  rare  in  an  ensemble  of  less 
modulus.    For  if  we  differentiate  the  equation 

regarding  et  as  constant,  but  6  and  therefore  yfrQ  as  variable, 
we  get 

whence  by  (192) 

'*^=i^A.  (213) 


(- 


d@Ju~     ©* 
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That  is,  a  diminution  of  the  modulus  will  diminish  the  proba- 
bility of  all  configurations  for  which  the  potential  energy  exceeds 
its  average  value  in  the  ensemble.  Again,  in  the  ensemble 
having  modulus  &  and  average  energies  ej  and  €,',  values  of 
eq  sensibly  less  than  ej  will  be  so  rare  as  to  be  practically  neg- 
ligible. They  will  be  still  more  rare  in  an  ensemble  of  greater 
modulus,  since  by  the  same  equation  an  increase  of  the 
modulus  will  diminish  the  probability  of  configurations  for 
which  the  potential  energy  is  less  than  its  average  value  in 
the  ensemble.  Therefore,  for  values  of  6  between  &  and  ©", 
and  of  €p  between  epf  and  €p",  the  individual  values  of  eq  will 
be  practically  limited  to  the  interval  between  e,'  and  e,". 

In  the  cases  which  remain  to  be  considered,  viz.,  when 
dejdep  has  very  large  values  not  confined  to  narrow  limits, 
and  consequently  the  differences  of  the  mean  potential  ener- 
gies in  ensembles  of  different  moduli  are  in.  general  very  large 
compared  with  the  differences  of  the  mean  kinetic  energies,  it 
appears  by  (210)  that  the  anomalies  of  mean  square  of  poten- 
tial energy,  if  not  small  in  comparison  with  the  mean  kinetic 
energy,  mil  yet  in  general  be  very  small  in  comparison  with 
differences  of  mean  potential  energy  in  ensembles  having 
moderate  differences  of  mean  kinetic  energy,  —  the  exceptions 
being  of  the  same  character  as  described  for  the  case  when 
dig/dej,  is  not  in  general  large. 

It  follows  that  to  human  experience  and  observation  with 
respect  to  such  an  ensemble  as  we  are  considering,  or  with 
respect  to  systems  which  may  be  regarded  as  taken  at  random 
from  such  an  ensemble,  when  the  number  of  degrees  of  free- 
dom is  of  such  order  of  magnitude  as  the  number  of  molecules 
in  the  bodies  subject  to  our  observation  and  experiment,  € — ef 
€,  —  ep,  *q  —  €q  would  be  in  general  vanishing  quantities, 
since  such  experience  would  not  be  wide  enough  to  embrace 
the  more  considerable  divergencies  from  the  mean  values,  and 
such  observation  not  nice  enough  to  distinguish  the  ordinary 
divergencies.  In  other  words,  such  ensembles  would  appear 
to  human  observation  as  ensembles  of  systems  of  uniform 
energy,  and  in  which  the  potential  and  kinetic  energies  (sup- 
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posing  th&t  there  were  means  of  measuring  these  quantities 
separately)  had  each  separately  uniform  values,*  Exceptions 
might  occur  when  for  particular  values  of  the  modulus  the 
differential  coefficient  (UJ<&9  takes  a  very  large  value.  To 
human  observation  the  effect  would  be,  that  in  ensembles  in 
which  6  and  e,  had  certain  critical  values,  <^  would  be  in- 
determinate within  certain  limits,  viz.,  the  values  which  would 
correspond  to  values  of  6  and  e9  slightly  less  and  slightly 
greater  than  the  critical  values.  Such  indeterminateness  cor- 
responds precisely  to  what  we  observe  in  experiments  on  the 
bodies  which  nature  presents  to  us.f 

To  obtain  general  formulae  for  the  average  values  of  powers 
of  the  energies,  we  may  proceed  as  follows.  If  A  is  any  posi- 
tive whole  number,  we  have  identically 

all  •  all  • 

f. .  . JVe~6«fr ...dqn=  *jfef-  •  •  JVV5*i •  •  •  dqm,  (214) 

pfcMM  plMMS 

i.e.,  by  (108), 

-*  4  -*■ 


?«  e  =  «* 


/?=»•  •).  (215) 


Henoe  ?•"• =  (&  ggY«~»t  (216) 

if  if 

and  7  =  e®  f  *gg)  •""*•  (217) 

*  This  implies  that  the  kinetic  and  potential  energies  of  individual  systems 
would  each  separately  hare  values  sensibly  constant  in  time. 

t  As  an  example,  we  may  take  a  system  consisting  of  a  fluid  in  a  cylinder 
under  a  weighted  piston,  with  a  vacuum  between  the  piston  and  the  top  of 
the  cylinder,  which  is  closed.  The  weighted  piston  is  to  be  regarded  as  a 
part  of  the  system.  (This  is  formally  necessary  in  order  to  satisfy  the  con- 
dition of  the  invariability  of  the  external  coordinates.)  It  is  evident  that  at 
a  certain  temperature,  viz.,  when  the  pressure  of  saturated  vapor  balances 
the  weight  of  the  piston,  there  is  an  indeterminateness  in  the  values  of  the 
potential  and  total  energies  as  functions  of  the  temperature. 
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For  h  =  1,  this  gives 

s— *»©         <*8> 

which  agrees  with  (191). 
From  (215)  we  have  also 

In  like  manner  from  the  identical  equation 

oonflg.  oouflg. 

(221) 
♦t/      j\»  _fa. 
to  get  «?  =  6e(®*S§)«    e>  <222> 

and  e7=^  +  «.^1^.  (223) 

With  respect  to  the  kinetic  energy  similar  equations  will 
hold  for  averages  taken  for  any  particular  configuration,  or 
for  the  whole  ensemble.    But  since 

the  equation 

?  =  {h  +  &m)?=t  (224) 

reduces  to 

^-G^S^-JS^**)1"**  (225) 
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We  have  therefore 

9-G+1)?*-  (226) 

?-6+8)(5+1)S*         <w 

^  =    T(in)     *•  <228> 

The  averse  values  of  the  powers  of  the  anomalies  of  the 
energies  are  perhaps  most  easily  found  as  follows.  We  have 
identically,  since  e  is  a  function  of  6,  while  e  is  a  function  of 
the  p'a  and  j's, 

•n  __« 


J...j[e(e-S)»_A(e-^©^J«   •**,..•  <*?.    (229) 


i.  «■.  by  (108), 
^[(^'"•J^^^-M'-i)"®1^]*-5,  (280) 

•  In  the  case  discussed  in  the  note  on  page  64  we  may  easily  get 
(««-««)*  =  (•«-««  +  ^je)  to-*)*-1, 

which,  with  «fl  —  **  =  s  ®» 

gives 

Hence  (•«  -  «*)*  =  •?. 

Again  (•-•«)»=  (;-f.  +  e»^)(t-fa)*-i, 

which  with  «  —  «4  =  n  e 

gives 

(T=7^  =  (ae  +  e«^)  (!  -«.)*-*  =  »(ne  +  e«^)*"V 
hence  (7=7?  =  ^T^f1  <* 
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or  since  by  (218) 


<£ 


In  precisely  the  same  way  we  may  obtain  for  the  potential 
energy 

(«.-^w=«^(««-^  +  A(«.-^-lef;»-  (232) 

By  successive  applications  of  (281)  we  obtain 

(€-5)f  =  I>5 


(l-^rM 


(6-€)*  =  2^€  +  3(Z)€)* 


(6  -.i)«  =  Z>*i  +  10  DcDS 


(e  -  €)•  =  2*i  +  lSZTeZ^i  +  10(2*i)P  +  15(Z)€)»  etc. 

where  D  represents  the  operator  O2  d/dS.  Similar  expres- 
sions relating  to  the  potential  energy  may  be  derived  from 
(282). 

For  the  kinetic  energy  we  may  write  similar  equations  in 
which  the  averages  may  be  taken  either  for  a  single  configura- 
tion or  for  the  whole  ensemble.    But  since 

dep^n 

5®  ""2 

the  general  formula  reduces  to 

fa-*)**  =  e»3g<*-*)»  +  ti.A*fa-*r*    (233) 
or 


fa  -  ^»)w = 2®  <*  fa  -  *)* .  2h  fa  -  6,)*  2&  fa  -i^r1 

(234) 
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But  since  identically 


^  i 

the  value  of  the  corresponding  expression  for  any  index  will 
be  independent  of  6  and  the  formula  reduces  to 

we  have  therefore 


w-*     (h3?-*- 


(¥T4 


etc* 


It  will  be  observed  that  when  ^r  or  e  is  given  as  function  of 
6,  all  averages  of  the  form  ?  or  (e  —  c)*  are  thereby  deter- 

•  In  the  case  discussed  in  {he  preceding  foot-notes  we  get  easily 

(€<r-ig)»=(t,-€p)*, 


For  the  total  energy  we  hare  in  this  case 


(^H*?)' 


\t  — fa'  *  Vf  —  «o'        flVt  — €a' 


(•-•V      »  _, 


Digitized  by  VjOOQIC 


80  AVERAGE  VALUES  IN  A   CANONICAL 

mined.  So  also  if  ^  or  eq  is  given  as  function  of  ©,  aP 
averages  of  the  form  <£*  or  (cq  —  eqy  are  determined.    But 

Therefore  if  any  one  of  the  quantities  yfr>  ^rr  i,  ^  is  known 
as  function  of  6,  and  n  is  also  known,  all  averages  of  any  of 
the  forms  mentioned  are  thereby  determined  as  functions  of 
the  same  variable.    In  any  case  all  averages  of  the  form 


are  known  in  terms  of  n  alone,  and  have  the  same  value 
whether  taken  for  the  whole  ensemble  or  limited  to  any 
particular  configuration. 
If  we  differentiate  the  equation 

all        A   c 

f...fe~*dPl...dqn  =  l  (236) 


with  respect  to  av  and  multiply  by  6,  we  have 

/  •  •/[£  "  £]•"*"*■  ••.«*.«*  (237) 

Differentiating  again,  with  respect  to  av  with  respect  to  a3, 
and  with  respect  to  ©,  we  have 

/rr  dV  d»e         1  f  <fy       de  \  /#      <fe  \"] 

"j  |_da,<Ja,      dajda,      ©Vdoj      dax)\dat    da,) J 

«e<?i>l...^.  =  0,    (239) 
e  e  tf>i . . .  dfc,  =  0.    (240) 
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The  multiple  integrals  in  the  last  four  equations  represent  the 
average  values  of  the  expressions  in  the  brackets,  which  we 
may  therefore  set  equal  to  zero.    The  first  gives 

as  already  obtained.  With  this  relation  and  (191)  we  get 
from  the  other  equations 

g^v-(£-3)-(§-S)  <*> 

We  may  add  for  comparison  equation  (205),  which  might  be 
derived  from  (286)  by  differentiating  twice  with  respect  to  6 : 

d&~     d®' 


(.-}-  — <&.:•£.  (244) 


The  two  last  equations  give 


dl- 


(A,  _  ^)  (e  -  «>  =  =p<«  -  1)*.  (246) 

at 


If  ^r  or  i  is  known  as  function  of  6,  o^  a^  etc.,  (*  —  e)*  may 
be  obtained  by  differentiation  as  function  of  the  same  variables. 
And  if  y(ty  or  Av  or  rj  is  known  as  function  of  ©,  ax,  etc., 
JAX  —  ^^  (6  —  €)  may  be  obtained  by  differentiation.  But 
(A1  —  JLxy  and  (-42  —  -lx)  (A2  —  <f a)  cannot  be  obtained  in  any 

similar  manner.  We  have  seen  that  (e— i)*  is  in  general  a 
vanishing  quantity  for  very  great  values  of  n,  which  we  may 
regard  as  contained  implicitly  in  6  as  a  divisor.    The  same  is 

true  of  (At  —  JL{)  (c  —  €).    It  does  not  appear  that  we  can 

assert  the  same  of  (At  —  -I1)a  or  (Ax  —  A{)  (-4a  —  -ia),  since 
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cPefdaf  may  be  very  great.  The  quantities  dhjdaf  and  dfyjdaf 
belong  to  the  class  called  elasticities.  The  former  expression 
represents  an  elasticity  measured  under  the  condition  that 
while  <*}  is  varied  the  internal  coordinates  jlf . .  •  qn  all  remain 
fixed.  The  latter  is  an  elasticity  measured  under  the  condi- 
tion that  when  a^  is  varied  the  ensemble  remains  canonically 
distributed  within  the  same  modulus.  This  corresponds  to 
an  elasticity  in  physics  measured  under  the  condition  of  con- 
stant temperature.  It  is  evident  that  the  former  is  greater 
than  the  latter,  and  it  may  be  enormously  greater. 

The  divergences  of  the  force  Al  from  its  average  value  are 
due  in  part  to  the  differences  of  energy  in  the  systems  of  the 
ensemble,  and  in  part  to  the  differences  in  the  value  ot 
the  forces  which  exist  in  systems  of  the  same  energy.  If  we 
write  JJ7L  for  the  average  value  of  Ax  in  systems  of  the 
ensemble  which  have  any  same  energy,  it  will  be  determined 
by  the  equation 

/•••/""  ^re  e  4*  •  •  •  *%* 

3Tlr=  ^L <W 


j . . .  j  e  e  dpt . . .  dqn 


where  the  limits  of  integration  in  both  multiple  integrals  are 
two  values  of  the  energy  which  differ  infinitely  little,  say  €  and 

e  +  de.  This  will  make  the  factor  e  e  constant  within  the 
limits  of  integration,  and  it  may  be  cancelled  in  the  numera- 
tor and  denominator,  leaving 

J  . . . J  dpi.  ..dqn 

where  the  integrals  as  before  are  to  be  taken  between  e  and 
€  +  de.  -Z7I*  k  therefore  independent  of  6,  being  a  function 
of  the  energy  and  the  external  coordinates. 
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Now  we  have  identically 

^  -  Jt  =  (^ -2T]€)  +  <2^€  -  2X), 

where  Ax  —  2J*  denotes  the  exoess  of  the  force  (tending  to 
increase  Oj)  exerted  by  any  system  above  the  average  of  such 
forces  for  systems  of  the  same  energy.    Accordingly, 


(j1-j;)*=(^1-2n€)»+2(^-3n€)(^.-^i)+(3nt-^i)a- 

But  the  average  value  of  (Ax  —  3[|€)  (J^]€  —  2{)  for  systems 
of  the  ensemble  which  have  the  same  energy  is  zero,  since  for 
such  systems  the  second  factor  is  constant.  Therefore  the 
average  for  the  whole  ensemble  is  zero,  and 

In  the  same  way  it  may  be  Shown  that 


It  is  evident  that  in  ensembles  in  which  the  anomalies  of 
energy  e  —  e  may  be  regarded  as  insensible  the  same  will  be 
true  of  the  quantities  represented  by  3^  —  Av 

The  properties  of  quantities  of  the  form  jQ  will  be 
farther  considered  in  Chapter  X,  which  will  be  devoted  to 
ensembles  of  constant  energy. 

It  may  not  be  without  interest  to  consider  some  general 
formulae  relating  to  averages  in  a  canonical  ensemble,  which 
embrace  many  of  the  results  which  have  been  given  in  this 
chapter. 

Let  u  be  any  function  of  the  internal  and  external  coordi- 
nates with  the  momenta  and  modulus.    We  have  by  definition 

. .  .Jus  e  (fa  . . .  dqn  (260) 


If  we  differentiate  with  respect  to  6,  we  have 
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or  d®      d®  ®*       *®d®  K      } 

Setting  u  =  1  in  this  equation,  we  get 

d®~     e 
and  substituting  this  value,  we  have 

du  _<&      U€      U€ 

de~50  +  i»  ""e* 


du      ^m3u      — 


or  &  —  -&  —  =  ue-u€  =  (u-u)(€-e).  (252) 

If  we  differentiate  equation  (250)  with  respect  to  a  (which 
may  represent  any  of  the  external  coordinates),  and  write  A 

for  the  force  —  57 »  we  g®* 


du      Uu       u  d\b      uA  yAMAv 

or  ™  =  ^  +  ZJr  +  ?±  (263) 

da      da      ®  da       ®  v/ 

Setting  ttslin  this  equation,  we  get 

Substituting  this  value,  we  have 

5a  =  ^  +  "0  ~  e'  (264) 

or  e^-0g  =  O«5J-==(^^^TTI):        (256) 

Repeated  applications  of  the  principles  expressed  by  equa- 
tions (252)  and  (255)  are  perhaps  best  made  in  the  particular 
cases.    Yet  we  may  write  (252)  in  this  form 
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(€  +  2>)(u-u,  =  0,  (266) 


where  D  represents  the  operator  0s  d/d&. 
Hence 


(€  +  Z>)*(*-u)  =  0,  (267) 

where  h  is  any  positive  whole  number.  It  will  be  observed, 
that  since  c  is  not  function  of  8,  (€  +  2))*  may  be  expanded  by 
the  binomial  theorem.    Or,  we  may  write 


(€  +  D)  u  =  (c  +  D)  u,  (268) 


whence  (c  +  D)h  u  =  (c  +  2>)*  5.  (269) 

But  the  operator  (e  +  D)*,  although  in  some  respects  more 
simple  than  the  operator  without  the  average  sign  on  the  *, 
cannot  be  expanded  oy  the  binomial  theorem,  since  e  is  a 
function  of  6  with  the  external  coordinates. 
So  from  equation  (254)  we  have 


(4  +  s)<*-5>-*  <260> 

whence                     (g  + :^)*(«-«)  =  0;  (261) 

-d            (F2>=(i+£)5'  <262> 

whence  (i  +  c)*— (1+s)5- 


(263) 


The  binomial  theorem  cannot  be  applied  to  these  operators. 

Again,  if  we  now  distinguish,  as  usual,  the  several  external 
coordinates  by  suffixes,  we  may  apply  successively  to  the 
expression  u  —  u  any  or  all  of  the  operators 

<+*3§    4+ea?   *  +  «ab  etc-    <26*> 
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as  many  times  as  we  choose,  and  in  any  order,  the  average 
value  of  the  result  will  be  zero.  Or,  if  we  apply  the  same 
operators  to  u,  and  finally  take  the  average  value,  it  will  be  the 
same  as  the  value  obtained  by  writing  the  sign  of  average 
separately  as  tt,  and  on  e,  Ax ,  A^ ,  etc.,  in  all  the  operators. 

If  u  is  independent  of  the  momenta,  formulae  similar  to 
the  preceding,  but  having  eq  in  place  of  e,  may  be  derived 
from  equation  (179). 
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CHAPTER  Vm. 

ON  CERTAIN  IMPORTANT  FUNCTIONS  OF  THE 
ENERGIES  OF  A  SY8TEM. 

In  older  to  consider  more  particularly  the  distribution  of  a 
canonical  ensemble  in  energy,  and  for  other  purposes,  it  will 
be  convenient  to  use  the  following  definitions  and  notations. 

Let  us  denote  by  P~the  extension-m-phase  below  a  certain 
limit  of  energy  which  we  shall  call  e.    That  is,  let 

the  integration  being  extended  (with  constant  values  of  the 
external  coordinates)  over  all  phases  for  which  the  energy  is 
less  than  the  limit  e.  We  shall  suppose  that  the  value  of  this 
integral  is  not  infinite,  except  for  an  infinite  value  of  the  lim- 
iting energy.  This  will  not  exclude  any  kind  of  system  to 
which  the  canonical  distribution  is  applicable.    For  if 


/•••/.-* 


dpl...dqn 


taken  without  limits  has  a  finite  value,*  the  less  value  repre- 
sented by 


J . . . j  dpt . .  .  dqm 


taken  below  a  limiting  value  of  e,  and  with  the  e  before  the 
integral  sign  representing  that  limiting  value,  will  also  be 
finite.  Therefore  the  value  of  V,  which  differs  only  by  a 
constant  factor,  will  also  be  finite,  for  finite  «.  It  is  a  func- 
tion of  e  and  the  external  coordinates,  a  continuous  increasing 

*  This  is  a  necessary  condition  of  the  canonical   distribution.     See 
Chapter  IV,  p.  86. 


Digitized  by  VjOOQIC 


88  CERTAIN  IMPORTANT  FUNCTIONS 

function  of  e,  which  becomes  infinite  with  e,  and  vanishes 
for  the  smallest  possible  value  of  e,  or  f or  e  =  —  oo,  if  the 
energy  may  be  diminished  without  limit. 
Let  us  also  set 

*  =  log~  (266) 

The  extension  in  phase  between  any  two  limits  of  energy,  4 
and  c",  will  be  represented  by  the  integral 

e*d*.  (267) 


x 


And  in  general,  we  may  substitute  e*  de  for  dp1 . . .  dqn  in  a 
2n-fold  integral,  reducing  it  to  a  simple  integral,  whenever 
the  limits  can  be  expressed  by  the  energy  alone,  and  the  other 
factor  under  the  integral  sign  is  a  function  of  the  energy  alone, 
or  with  quantities  which  are  constant  in  the  integration. 

In  particular  we  observe  that  the  probability  that  the  energy 
of  an  unspecified  system  of  a  canonical  ensemble  lies  between 
the  limits  e  and  e"  will  be  represented  by  the  integral  * 


f 


e+*. 


l>9  *>  <268) 

and  that  the  average  value  in  the  ensemble  of  any  quantity 
which  only  varies  with  the  energy  is  given  by  the  equation  f 

ue e         de,  (269) 


--f 


where  we  may  regard  the  constant  yfr  as  determined  by  the 
equation  $ 

e  e=/        e  e       de,  (270) 

r=o 
In  regard  to  the  lower  limit  in  these  integrals,  it  will  be  ob- 
served that  V  =.  0  is  equivalent  to  the  condition  that  the 
value  of  e  is  the  least  possible. 

*  Compare  equation  (93).  t  Compare  equation  (108). 

t  Compare  equation  (92). 
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In  like  manner,  let  us  denote  by  Vq  the  extension-in-configu- 
ration  below  a  certain  limit  of  potential  energy  which  we  may 
call  er    That  is,  let 

F'=/' '  'J****1  "  '  ***  (271) 

the  integration  being  extended  (with  constant  values  of  the 
external  coordinates)  over  all  configurations  for  which  the 
potential  energy  is  less  than  eq.  Vq  will  be  a  function  of  eq 
with  the  external  coordinates,  an  increasing  function  of  eq, 
which  does  not  become  infinite  (in  such  cases  as  we  shall  con- 
sider*) for  any  finite  value  of  er  It  vanishes  for  the  least 
possible  value  of  e#  or  for  eq  =  —  oo ,  if  eq  can  be  diminished 
without  limit.  It  is  not  always  a  continuous  function  of  eq. 
In  fact,  if  there  is  a  finite  extension-in-configuration  of  con- 
stant potential  energy,  the  corresponding  value  of  Vq  will 
not  include  that  extension-in-configuration,  but  if  eq  be  in- 
creased infinitesimally,  the  corresponding  value  of  Vq  will  be 
increased  by  that  finite  extension-in-configuration. 
Let  us  also  set 

*.  =  log^.  (272) 

The   extension-in-configuration  between  any  two  limits  of 
potential  energy  eq  and  eq"  may  be  represented  by  the  integral 


r 


e^d^  (273) 

whenever  there  is  no  discontinuity  in  the  value  of  Vq  as 
function  of  eq  between  or  at  those  limits,  that  is,  when- 
ever there  is  no  finite  extension-in-configuration  of  constant 
potential  energy  between  or  at  the  limits.  And  in  general, 
with  the  restriction  mentioned,  we  may  substitute  e*9  det  for 
Aj  dqx . . .  dqn  in  an  n-fold  integral,  reducing  it  to  a  simple 
integral,  when  the  limits  are  expressed  by  the  potential  energy, 
and  the  other  factor  under  the  integral  sign  is  a  function  of 

*  If  Vq  were  infinite  for  finite  values  of  *v  V  would  evidently  be  infinite 
for  finite  valnet  of  «. 
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the  potential  energy,  either  alone  or  with  quantities  which  are 
constant  in  the  integration. 

We  may  often  avoid  the  inconvenience  occasioned  by  for- 
mulae becoming  illusory  on  account  of  discontinuities  in  the 
values  of  Vq  as  function  of  eq  by  substituting  for  the  given 
discontinuous  function  a  continuous  function  which  is  practi- 
cally equivalent  to  the  given  funotion  for  the  purposes  of  the 
evaluations  desired.  It  only  requires  infinitesimal  changes  of 
potential  energy  to  destroy  the  finite  extensions-in-configura- 
tdon  of  constant  potential  energy  which  are  the  cause  of  the 
difficulty. 

In  the  case  of  an  ensemble  of  systems  canonically  distributed 
in  configuration,  when  Vq  is,  or  may  be  regarded  as,  a  continu- 
ous function  of  eq  (within  the  limits  considered),  the  proba- 
bility that  the  potential  energy  of  an  unspecified  system  lies 
between  the  limits  eq  and  eq"  is  given  by  the  integral 


r 


'*".fc?  +  V  (274) 


where  yfr  may  be  determined  by  the  condition  that  the  value  of 
the  integral  is  unity,  when  the  limits  include  all  possible 
values  of  er  In  the  same  case,  the  average  value  in  the  en- 
semble of  any  function  of  the  potential  energy  is  given  by  the 
equation 

€i=*>  fcri 


ue  det.  (275) 


8.r. 

F,=0 

When  Vq  is  not  a  continuous  function  of  e#  we  may  write  dVq 
for  e*9d€q  in  these  formulae. 

In  like  manner  also,  for  any  given  configuration,  let  us 
denote  by  Vp  the  extension-in-velocity  below  a  certain  limit  of 
kinetic  energy  specified  by  %.    That  is,  let 

V,  = J. . . JV  4h  •  •  •  4h .  (276) 


Digitized  by  VjOOQIC 


OF  THE  ENERGIES  OF  A  SYSTEM.  91 

the  integration  being  extended,  with  constant  values  of  the 
coordinates,  both  internal  and  external,  over  all  values  of  the 
momenta  for  which  the  kinetic  energy  is  less  than  the  limit  ^ 
Vp  will  evidently  be  a  continuous  increasing  function  of  ep 
which  vanishes  and  becomes  infinite  with  ^    Let  us  set 

4»  =  l°g^f.  (277) 

The  extension-in-velocity  between  any  two  limits  of  kinetic 
energy  <?/  and  <?/'  may  be  represented  by  the  integral 


/ 


e^dep.  (278) 

And  in  general,  we  may  substitute  e**  dep  for  AJ  dpx. . .  dpn 
or  AJ  dqx . . .  dqn  in  an  n-f old  integral  in  which  the  coordi- 
nates are  constant,  reducing  it  to  a  simple  integral,  when  the 
limits  are  expressed  by  the  kinetic  energy,  and  the  other  factor 
under  the  integral  sign  is  a  function  of  the  kinetic  energy, 
either  alone  or  with  quantities  which  are  constant  in  the 
integration. 

It  is  easy  to  express  Vp  and  $p  in  terms  of  er    Since  A,  is 
function  of  the  coordinates  alone,  we  have  by  definition 

F,  =  A* J.  .  .fdfr  ...dPn  (279) 

the  limits  of  the  integral  being  given  by  «p.    That  is,  if 

€P^F{pX9...pm)i  (280) 

the  limits  of  the  integral  for  e,  =  1,  are  given  by  the  equation 
r<JHf-Jti  =  lf  (281) 

and  the  limits  of  the  integral  for  ep  =  a1,  are  given  by  the 
equation 

F(Pi,.--Pn)  =  **-  (282) 

But  since  F  represents  a  quadratic  function,  this  equation 
maybe  written 
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The  value  of  Vp  may  also  be  put  in  the  form 

F,-*A,I/...JV&...*&.  (284) 

Now  we  may  determine  Vv  for  €9  =  1  from  (279)  where  the 
limits  are  expressed  by  (281),  and  Vp  for  ep  =  a2  from  (284) 
taking  the  limits  from  (288).  The  two  integrals  thus  deter- 
mined are  evidently  identical,  and  we  have 

» 

i.  6.,  Vp  varies  as  €,*.    We  may  therefore  set 

FP=CV,  e^CeJ-1  (286) 

where  C  is  a  constant,  at  least  for  fixed  values  of  the  internal 
coordinates. 

To  determine  this  constant,  let  us  consider  the  case  of  a 
canonical  distribution,  for  which  we  have 


£• 


e   e    +  *\fe,  =  l, 


fa  _2 

where  ee  =  (2ir0)  *. 


Substituting  this  value,  and  that  of  e**  from  (286),  we  get 

BO  _f?       2—1  n 


fCr(j)  =  (2»)5,  (287) 

g-     <2^     . 
r«n  +  1) 

Having  thus  determined  (he  value  of  the  constant  C,  we  may 
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substitute  it  in  the  general  expressions  (286),  and  obtain  the 
following  values,  which  are  perfectly  general: 

H 

fr  _    ff^fr)  (288) 


n    n    ~ 
T(in) 


►(289) 


It  will  be  observed  that  the  values  6f  V9  and  <f>,  for  any 
given  €p  are  independent  of  the  configuration,  and  even  of  the 
nature  of  the  system  considered,  except  with  respect  to  its 
number  of  degrees  of  freedom. 

Returning  to  the  canonical  ensemble,  we  may  express  the 
probability  that  the  kinetic  energy  of  a  system  of  a  given 
configuration,  but  otherwise  unspecified,  falls  within  given 
limits,  by  either  member  of  the  following  equation 

Since  this  value  is  independent  of  the  coordinates  it  also 
represents  the  probability  that  the  kinetic  energy  of  an 
unspecified  system  of  a  canonical  ensemble  falls  within  the 
limits.  The  form  of  the  last  integral  also  shows  that  the  prob- 
ability that  the  ratio  of  the  kinetic  energy  to  the  modulus 

*  Very  similar  rabies  for  Vq,  e+t,  V,  and  «+  may  be  found  in  the  same 
way  in  the  case  discussed  in  the  preceding  foot-notes  (see  pages  64, 72, 77,  and 
79),  in  which  tff  is  a  quadratic  function  of  the  <ftt  and  A*  independent  of  the  ^s. 
In  this  case  we  have 


r,fAA»(a»)8(«t-«.)8 
K«    \bj     r(i»  +  i)    • 

K  =  U,/     r(»  +  i)    • 


&*)'(. -*ri 


V         r(») 
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falls  -within  given  limits  is  independent  also  of  the  value  of 

the  modulus,  being  determined  entirely  by  the  number  of 

degrees  of  freedom  of  the  system  and  the  limiting  values 

of  the  ratio. 

The  average  value  of  any  function  of  the  kinetic  energy, 

either  for  the  whole  ensemble,  or  for  any  particular  configurer 

tion,  is  given  by 

t- 

«=^J f"ue~*cl-*de,  •(291) 

e'rftn) 

Thus: 

V:  =  r(r(i4W)<r>       "       m  +  *">05       t(292) 

*  The  corresponding  equation  for  the  average  value  of  any  function  of 
the  potential  energy,  when  this  is  a  quadratic  function  of  the  ?*s,  and  A$  is 
independent  of  the  q%  is 

In  the  same  case,  the  average  value  of  any  function  of  the  (total)  energy  is 
given  by  the  equation 


1    / 


u«     •    (•-«.)      rf*. 


Hence  in  thii  case 


f =i-     *     •>* 

«d  g.l.     if     „>i. 

If  n  =  1,  e*  =  2ir  and  tty/rfc  =  0  for  any  value  of  t.    If  n  =  2,  the  case  is 
the  same  with  respect  to  $<? 

t  This  equation  has  already  been  proved  for  positive  integral  powers  of 
the  kinetic  energy.    See  page  77. 
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^T^?^*"1'     «     •>»»      <»*) 
f*  =  i,        if       »>2;  (295) 

•Z=*'^  =  «.  (296) 

If  »  =  2,  «*»  =  2»,  and  dfo/dt,  =  0,  for  any  value  of  «^. 
The  definitions  of  P;  Vr  and  F^  give 

V=ffdV,drt  (297) 

where  the  integrations  cover  all  phases  for  which  the  energy 
is  less  than  the  limit  c,  for  which  the  value  of  Via  sought 
This  gives 

V=j  V,dV„  (298) 

Fts0 

and  ._    «tf« 

S=-£=fe*>dV„  (299) 

r«=o 

where  Vp  and  &*  are  connected  with  Vq  by  the  equation 

c,  +  ^  =  constant  =  e.  (300) 

If  n  >  2,  c^  vanishes  at  the  upper  limit,  t-  *♦,  f or  «,  =  0,  and 
we  get  by  another  differentiation 

We  may  also  write 

V=J    V,^^  (802) 

VqzzO 

e+ =  7  e+p++<dev  (808) 


t 
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etc*,  when  Vq  is  a  continuous  function  of  eq  commencing  with 
the  value  Vq  =  0,  or  when  we  choose  to  attribute  to  Vq  a 
fictitious  continuity  commencing  with  the  value  zero,  as  de- 
scribed on  page  90. 

If  we  substitute  in  these  equations  the  values  of  Vp  and  e*p 
which  we  have  found,  we  get 

*     €,-€  n 

F=ra(n+)2i)A<-e'>5dF«>  (304> 


where  e**  cUq  may  be  substituted  for  dVq  in  the  cases  above 
described.  If,  therefore,  n  is  known,  and  Vq  as  function  of 
cr  Fand  €*  may  be  found  by  quadratures. 

It  appears  from  these  equations  that  Via  always  a  continu- 
ous increasing  function  of  €,  commencing  with  the  value  V= 
0,  even  when  this  is  not  the  case  with  respect  to  Vq  and  er 
The  same  is  true  of  e*,  when  n  >  2,  or  when  n  =  2  if  ^in- 
creases continuously  with  eq  from  the  value  Vq  =  0. 

The  last  equation  may  be  derived  from  the  preceding  by 
differentiation  with  respect  to  e.  Successive  differentiations 
give,  if  h  <  J  n  +  1, 

n 


2  -*--  * 


rg=o    F  rf=o 

&V/df  is  therefore  positive  if  A  <  J  n  +  1.  It  is  an  in- 
creasing function  of  e,  if  h<  \n.  If  €  is  not  capable  of 
being  diminished  without  limit,  &V/d£  vanishes  for  the 
least  possible  value  of  e,  if  h  <  \n. 

If  n  is  even, 

» 

^F=(2x)i(Ft)  (307) 
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That  is,  Vq  is  the  same  function  of  e^  as ;  — r  of  c. 

(2*r)*  efei 

When  n  is  large,  approximate  formulae  will  be  more  avail- 
able. It  will  be  sufficient  to  indicate  the  method  proposed, 
without  precise  discussion  of  the  limits  of  its  applicability  or 
of  the  degree  of  its  approximation.  For  the  value  of  e*  cor- 
responding to  any  given  e,  we  have 

'e*'4*'^  =  f •***  •  d*,$  (308) 

vtzzo  y 

where  the  variables  are  connected  by  the  equation  (300). 
The  maximum  value  of  <£p  +  </>ff  is  therefore  characterized  by 
the  equation 

The  values  of  e,  and  eq  determined  by  this  maximum  we  shall 
distinguish  by  accents,  and  mark  the  corresponding  values  of 
functions  of  Cp  and  %  in  the  same  way.  Now  we  have  by 
Taylor's  theorem 

If  the  approximation  is  sufficient  without  going  beyond  the 
quadratic  terms,  since  by  (300) 

we  may  write 


e+ 


=  e+p'++*[el\*S'    W'J     2     *!i»     (312) 


where  the  limits  have  been  made  ±  oo  for  analytical  simplicity. 
This  is  allowable  when  the  quantity  in  the  square  brackets 
has  a  very  large  negative  value,  since  the  part  of  the  integral 
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corresponding  to  other  than  very  small  values  of  eq  —  ea'  may 
be  regarded  as  a  vanishing  quantity. 
This  gives 

(313) 


>  _  „v+v  r       ~2ir      i  * 


or 


♦  -V+*/  +  *to»-C»*)-*loB  [-($)- (gp)1].    (3W) 

From  this  equation,  with  (289),  (800)  and  (809),  we  may 
determine  the  value  of  <f>  corresponding  to  any  given  value  of 
6,  when  <f>q  is  known  as  function  of  er 

Any  two  systems  may  be  regarded  as  together  forming  a 
third  system.  If  we  have  For  <f>  given  as  function  of  e  for 
any  two  systems,  we  may  express  by  quadratures  Fand  <f>  for 
the  system  formed  by  combining  the  two.  If  we  distinguish 
by  the  suffixes  (  )x,  (  )a,  (  )w  the  quantities  relating  to 
the  three  systems,  we  have  easily  from  the  definitions  of  these 
quantities 

V*=C  CiV1dV%=Cvtdr1  =  frxdV%  =  fvxt%a*«  (315) 

e+»  =  fadVt  =  fe^d  Ff  = />+***  (316) 

where  the  double  integral  is  to  be  taken  within  the  limits 

Vi  =  0,  Vt  =  0,  and  ex  +  ef  =  €u, 
and  the  variables  in  the  single  integrals  are  connected  by  the 
last  of  these  equations,  while  the  limits  are  given  by  the  first 
two,  which  characterize  the  least  possible  values  of  et  and  e3 
respectively. 

It  will  be  observed  that  these  equations  are  identical  in 
form  with  those  by  which  Fand  <f>  are  derived  from  Vp  or  <f>p 
and  Vq  or  <f>v  except  that  they  do  not  admit  in  the  general 
case  those  transformations  which  result  from  substituting  for 
Vp  or  4>p  the  particular  functions  which  these  symbols  always 
represent. 
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Similar  formulae  may  be  used  to  derive  Vq  or  <f>q  for  the 
compound  system,  when  one  of  these  quantities  is  known 
as  function  of  the  potential  energy  in  each  of  the  systems 
combined. 

The  operation  represented  by  such  an  equation  as 

e      =   I  e    e    d€i 

is  identical  with  one  of  the  fundamental  operations  of  the 
theory  of  errors,  viz.,  that  of  finding  the  probability  of  an  error 
from  the  probabilities  of  partial  errors  of  which  it  is  made  up. 
It  admits  a  simple  geometrical  illustration. 

We  may  take  a  horizontal  line  as  an  axis  of  abscissas,  and  lay 
off  d  as  an  abscissa  measured  to  the  right  of  any  origin,  and 
erect  e*i  as  a  corresponding  ordinate,  thus  determining  a  certain 
curve.  Again,  taking  a  different  origin,  we  may  lay  off  e*  as 
abscissas  measured  to  the  left,  and  determine  a  second  curve  by 
erecting  the  ordinates  e+*.  We  may  suppose  the  distance  be- 
tween the  origins  to  be  e12,  the  second  origin  being  to  the  right 
if  €19  is  positive.  We  may  determine  a  third  curve  by  erecting 
at  every  point  in  the  line  (between  the  least  values  of  ^  and  e,) 
an  ordinate  which  represents  the  product  of  the  two  ordinates 
belonging  to  the  curves  already  described.  The  area  between 
this  third  curve  and  the  axis  of  abscissas  will  represent  the  value 
of  e*«.  To  get  the  value  of  this  quantity  for  varying  values 
of  613,  we  may  suppose  the  first  two  curves  to  be  rigidly  con- 
structed, and  to  be  capable  of  being  moved  independently.  We 
may  increase  or  diminish  t19  by  moving  one  of  these  curves  to 
the  right  or  left.  The  third  curve  must  be  constructed  anew 
for  each  different  value  of  e12. 
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CHAPTER  IX. 

THE  FUNCTION  <f>  AND  THE  CANONICAL  DISTRIBUTION. 

In  this  chapter  we  shall  return  to  the  consideration  of  the 
canonical  distribution,  in  order  to  investigate  those  properties 
which  are  especially  related  to  the  function  of  the  energy 
which  we  have  denoted  by  <f>. 

If  we  denote  by  -AT,  as  usual,  the  total  number  of  systems 
in  the  ensemble, 

will  represent  the  number  having  energies  between  the  limits 
€  and  e  +  de.    The  expression 

represents  what  may  be  called  the  density-vnrtnergy.  This 
vanishes  for  e  =  oo,  for  otherwise  the  necessary  equation 

C  e~*~+  de  =  l  (318) 

r=o 

could  not  be  fulfilled.  For  the  same  reason  the  density-in- 
energy  will  vanish  for  €  =  —  oo,  if  that  is  a  possible  value  of 
the  energy.  Generally,  however,  the  least  possible  value  of 
the  energy  will  be  a  finite  value,  for  which,  if  n  >  2,  et  will 
vanish,*  and  therefore  the  density-in-energy.  Now  the  density- 
in-energy  is  necessarily  positive,  and  since  it  vanishes  for 
extreme  values  of  the  energy  if  n  >  2,  it  must  have  a  maxi- 
mum in  such  cases,  in  which  the  energy  may  be  said  to  have 

*  See  page  90. 
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its   most  common    or   most  probable  value,  and  which  is 
determined  by  the  equation 


<U       B' 


(819) 


This  value  of  Afr/de  is  also,  when  *  >  2,  its  average  value 
in  the  ensemble.  For  we  have  identically,  by  integration  by 
parts, 

/*•    *-[••  \+U*    *■    (320) 

F=0  F=0       F=0 

If  n  >  2,  the  expression  in  the  brackets,  which  multiplied  by  N 
would  represent  the  density-in-energy,  vanishes  at  the  limits, 
and  we  have  by  (269)  and  (&18) 

It  appears,  therefore,  that  for  systems  of  more  than  two  degrees 
of  freedom,  the  average  value  of  cty/de  in  an  ensemble  canoni- 
cally  distributed  is  identical  with  the  value  of  the  same  differ- 
ential coefficient  as  calculated  for  the  most  common  energy 
in  the  ensemble,  both  values  being  reciprocals  of  the  modulus. 
Hitherto,  in  our  consideration  of  the  quantities  Vy  V#  Vp,  <f>, 
4>r  4>p>  we  ^ave  regarded  the  external  coordinates  as  constant. 
It  is  evident,  however,  from  their  definitions  that  Kand  <f>  are 
in  general  functions  of  the  external  coordinates  and  the  energy 
(e),  that  Vq  and  <f>q  are  in  general  functions  of  the  external 
coordinates  and  the  potential  energy  (e,).  Vp  and  <f>p  we  have 
found  to  be  functions  of  the  kinetic  energy  (ep)  alone.  In  the 
equation 

—31        n    —  _+^ 

e  e=l   e  •     de,  (322) 

by  which  ^r  may  be  determined,  8  and  the  external  coordinates 
(contained  implicitly  in  ^)  are  constant  in  the  integration. 
The  equation  shows  that  f  is  a  function  of  these  constants. 
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If  their  values  are  varied,  we  shall  have  fay  differentiation,  if 
n  >2, 

A  t=rco 


« • 


t=CO  (  #  =  oo  f 

+  daifyL6~***d€  +  da  tf^e^dc  +  eto.    (323) 

F=0  F=0 

(Since  «♦  vanishes  with  F,  when  n  >  %  there  are  no  terms  due 
to  the  variations  of  the  limits.)     Hence  by  (269) 

-|^  +  ^^  =  ^^+g(ia1  +  gdaf  +  etc.,    (324) 
or,  since  &    =  ^  (325) 

<ty  =  yd®  —  0 5~^  —  05^  **■  -  etc*        (326) 
Comparing  this  with  (112),  we  get 

*E-3f    ^-  =  5,   etc.  (327) 

The  first  of  these  equations  might  be  written* 

(SL=-(D.(IL       « 

but  must  not  be  confounded  with  the  equation 
which  is  derived  immediately  from  the  identity 

•  See  equations  (321)  and  (104).  Suffixes  are  here  added  to  the  differential 
coefficients,  to  make  the  meaning  perfectly  distinct,  although  the  same  quan- 
tities may  be  written  elsewhere  without  the  suffixes,  when  it  is  believed  that 
there  is  no  danger  of  misapprehension.  The  suffixes  indicate  the  quantities 
which  are  constant  in  the  differentiation,  the  single  letter  a  standing  for  aU 
the  letters  ai,  <4,  etc.,  or  all  except  the  one  which  is  explicitly  varied. 
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Moreover,  if  we  eliminate  <ty  from  (326)  by  the  equation 
df  =  %dy  +  yd%  +  de,  (331) 

obtained  by  differentiating  (325),  we  get 

<£  =  -  e<^-  e|£ cUh  -  ®  p-do,  -  etc,  (332) 

or  by  (821), 

Except  for  the  signs  of  average,  the  second  member  of  this 
equation  is  the  same  as  that  of  the  identity 

*♦=£* +£*«.+ £<*•+<*«•    <3s4> 

For  the  more  precise  comparison  of  these  equations,  we  may 
suppose  that  the  energy  in  the  last  equation  is  some  definite 
and  fairly  representative  energy  in  the  ensemble.  For  this 
purpose  we  might  choose  the  average  energy.  It  will  per- 
haps be  more  convenient  to  choose  the  most  common  energy, 
which  we  shall  denote  by  e0.  The  same  fcuffix  will  be  applied 
to  functions  of  the  energy  determined  for  this  value.  Our 
identity  then  becomes 

* = (S> + (£)  ** + (£)  ** + *•  « 

It  has  been  shown  that 


35 

de 


-CSX-i  <°*> 


when  n  >  2.  Moreover,  since  the  external  coordinates  have 
constant  values  throughout  the  ensemble,  the  values  of 
cUf>ldav  d<f>/dav  etc.  vary  in  the  ensemble  only  on  account 
of  the  variations  of  the  energy  (e),  which,  as  we  have  seen, 
may  be  regarded  as  sensibly  constant  throughout  the  en- 
semble, when  n  is  very  great.  In  this  case,  therefore,  we  may 
regard  the  average  values 

2'  £•  eto-> 
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as  practically  equivalent  to  the  values  relating  to  the  most 
common  energy 


(£);  (£):  ~ 


In  this  case  also  cte  is  practically  equivalent  to  de0.  We  have 
therefore,  for  very  large  values  of  w, 

-^  =  ^  (337) 

approximately.  That  is,  except  for  an  additive  constant,  —  rj 
may  be  regarded  as  practically  equivalent  to  <£0,  when  the 
number  of  degrees  of  freedom  of  the  system  is  very  great. 
It  is  not  meant  by  this  that  the  variable  part  of  i]  +  <f>0  is 
numerically  of  a  lower  order  of  magnitude  than  unity.  For 
when  n  is  very  great,  —  17  and  ^0  are  very  great,  and  we  can 
only  conclude  that  the  variable  part  of  rj  +  <f>0  is  insignifi- 
cant compared  with  the  variable  part  of  rj  or  of  <f>Qy  taken 
separately. 

Now  we  have  already  noticed  a  certain  correspondence 
between  the  quantities  8  and  1/  and  those  which  in  thermo- 
dynamics are  called  temperature  and  entropy.  The  property 
just  demonstrated,  with  those  expressed  by  equation  (336), 
therefore  suggests  that  the  quantities  <f>  and  de/d^  may  also 
correspond  to  the  thermodynamic  notions  of  entropy  and  tem- 
perature. We  leave  the  discussion  of  this  point  to  a  sub- 
sequent chapter,  and  only  mention  it  here  to  justify  the 
somewhat  detailed  investigation  of  the  relations  of  these 
quantities. 

We  may  get  a  clearer  view  of  the  limiting  form  of  the 
relations  when  the  number  of  degrees  of  freedom  is  indefi- 
nitely increased,  if  we  expand  the  function  <f>  in  a  series 
arranged  according  to  ascending  powers  of  €  —  eD.  This  ex- 
pansion may  be  written 

♦-*+(£).«.-* >♦(&*¥*♦  (ffl.^~ 

(338) 
Adding  the  identical  equation 
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we  get  by  (386) 


.■+*-tZ*+++W!l=P+&)*&+-- 


Substituting  this  value  in 


/ 


.  • +v 


which  expresses  the  probability  that  the  energy  of  an  unspeci- 
fied system  of  the  ensemble  lies  between  the  limits  c7  and  e", 
we  get 

r^ff***  )r*-+\**).-ir + etc*.     (340) 

When  the  number  of  degrees  of  freedom  is  very  great,  and 
€  —  e0  in  consequence  very  small,  we  may  neglect  the  higher 
powers  and  write* 

•■^H*J1iW.'T~*.  (341) 

t' 

This  shows  that   for  a  very  great  number  of  degrees  of 

freedom  the  probability  of  deviations  of  energy  from  the  most 

probable  value  (e0)  approaches  the  form  expressed  by  the 

'law  of  errors.'    With  this  approximate  law,  we  get 

*  If  a  higher  degree  of  accuracy  is  desired  than  is  afforded  by  this  formula, 
it  may  be  multiplied  by  the  series  obtained  from 


(£V-^+- 


by  the  ordinary  formula  for  the  expansion  in  series  of  an  exponential  func- 
tion. There  would  be  no  especial  analytical  difficulty  in  taking  account  of 
a  moderate  number  of  terms  of  such  a  series,  which  would  commence 


i+ 


(S).^©).^- 
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1  =  1,  (342) 


PIT 


2  =  6.,        (TT^—^y,  (343) 

whence 

(**}  

^  +  *  =  ttag^r5r  =  -*iog(3,r<«-«)»X     (844) 

Now  it  has  been  proved  in  Chapter  VII  that 

We  have  therefore 

^  +  ^  =  ^  +  ^=-*log(2.(7^)  =  -ilog(^V) 

(345) 
approximately.  The  order  of  magnitude  of  rj  —  <f>Q  is  there- 
fore that  of  log  ft.  This  magnitude  is  mainly  constant. 
The  order  of  magnitude  of  rj  +  ^0  —  J  log  »  is  that  of  unity. 
The  order  of  magnitude  of  ^0,  and  therefore  of  —  ij,  is  that 

of  ft.* 

Equation  (338)  gives  for  the  first  approximation 


<♦-*)<— *)-GH3-!|s.  (347) 

(y=^_S=£.«j|  (348) 

The  members  of  the  last  equation  have  the  order  of  magnitude 
of  ft.    Equation  (338)  gives  also  for  the  first  approximation 


•  Compare  (889),  (314). 
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whence 

(g-D<«-*>=(g).^-=-i.    <m 


This  is  of  the  order  of  magnitude  of  iu* 

It  should  be  observed  that  the  approximate  distribution  of 
the  ensemble  in  energy  according  to  the  4law  of  errors'  is 
not  dependent  on  the  particular  form  of  the  function  of  the 
energy  which  we  have  assumed  for  the  index  of  probability 
(17).    In  any  case,  we  must  have 


V^&rsl,  (361) 


«  =  <■ 

/' 

where  «*+♦  is  necessarily  positive.  This  requires  that  it 
shall  vanish  f or  e  =  oo ,  and  al^o  for  e  =  —  00 ,  if  this  is  a  possi- 
ble value.  It  has  been  shown  in  the  last  chapter  that  if  e  has 
a  (finite)  least  possible  value  (which  is  the  usual  case)  and 
n  >  2,  et  will  vanish  for  that  least  value  of  e.  In  general 
therefore  n  +  <t>  will  have  a  maximum,  which  determines  the 
most  probable  value  of  the  energy.  If  we  denote  this  value 
by  €0,  and  distinguish  the  corresponding  values  of  the  f  uno-  - 
tions  of  the  energy  by  the  same  suffix,  we  shall  have 

The  probability  that  an  unspecified  system  of  the  ensemble 

•  We  shall  find  hereafter  that  the  equation 

(£-*)<-=>— 

Is  exact  for  any  ralue  of  n  greater  than  2,  and  that  the  equation 
(d<t>      l\a_     d*+ 

\d*     e/  ~    <u* 

is  exact  for  any  ralue  of  n  greater  than  4. 
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falls  within  any  given  limits  of  energy  (c7  and  e")  is  repre- 
sented by 


/• 


If  we  expand  17  and  <f>  in  ascending  powers  of  c  —  e0,  without 
going  beyond  the  squares,  the  probability  that  the  energy  falls 
within  the  given  limits  takes  the  form  of  the  *  law  of  errors  * 

e^JelU<)o+Kd,d]   2   fc  (363) 

This  gives 

•+*-**[i£(3).-£(3)J   (3H) 

-        ^)-'=[-(S).-(S)J-'        <m 

We  shall  have  a  close  approximation  in  general  when  the 
quantities  equated  in  (355)  are  very  small,  i.  e.,  when 

-(3).-(S).  <w> 

is  very  great.  Now  when  71  is  very  great,  —  d*<f>/d<?  is  of  the 
same  order  of  magnitude,  and  the  condition  that  (356)  shall 
be  very  great  does  not  restrict  very  much  the  nature  of  the 
function  1/. 

We  may  obtain  other  properties  pertaining  to  average  values 
in  a  canonical  ensemble  by  the  method  used  for  the  average  of 
d<f>/d€.  Let  u  be  any  function  of  the  energy,  either  alone  or 
with  8  and  the  external  coordinates.  The  average  value  of  u 
in  the  ensemble  is  determined  by  the  equation 

«=®    *_€ 

/- — -  +  * 
»ee         cfe.  (357) 

F=0 
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Now  we  have  identically 

t=   OD  4— *  A_«  t=   CD 

/(£-!+«g)«T+'*=[«^+']  « 

F=0  F=0 

Therefore,  by  the  preceding  equation 


—  _  ^    €  f  : 

ff-5  +  w2-l,lt#      ] 


*(369) 


F=0 


If  we  set  u  =  1,  (a  value  which  need  not  be  excluded,)  the 
second  member  of  this  equation  vanishes,  as  shown  on  page 
101,  if  n  >  2,  and  we  get 


2$_  1 


(360) 


as  before.  It  is  evident  from  the  same  considerations  that  the 
second  member  of  (359)  will  always  vanish  if  n  >  2,  unless  u 
becomes  infinite  at  one  of  the  limits,  in  which  case  a  more  care- 
ful examination  of  the  value  of  the  expression  will  be  necessary. 
To  facilitate  the  discussion  of  such  cases,  it  will  be  convenient 
to  introduce  a  certain  limitation  in  regard  to  the  nature  of  the 
system  considered.  We  have  necessarily  supposed,  in  all  our 
treatment  of  systems  canonically  distributed,  that  the  system 
considered  was  such  as  to  be  capable  of  the  canonical  distri- 
bution with  the  given  value  of  the  modulus.  We  shall  now 
suppose  that  the  system  is  such  as  to  be  capable  of  a  canonical 
distribution  with  any  (finite)  f  modulus.  Let  us  see  what 
cases  we  exclude  by  this  last  limitation. 


*  A  more  general  equation,  which  is  not  limited  to  ensembles  canonically 
distributed,  is 

where  ij  denotes,  as  usual,  the  index  of  probability  of  phase. 

t  The  term  finite  applied  to  the  modulus  is  intended  to  exclude  the  value 
zero  as  well  as  infinity. 
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The  impossibility  of  a  canonical  distribution  occurs  when 
the  equation 

4      €  =  00      * 

6  e=J  6  e     de  (361) 

fails  to  determine  a  finite  value  for  yfr.  Evidently  the  equation 
cannot  make  yfr  an  infinite  positive  quantity,  the  impossibility 
therefore  occurs  when  the  equation  makes  ^r  =  —  oo .  Now 
we  get  easily  from  (191) 

If  the  canonical  distribution  is  possible  for  any  values  of  6, 
we  can  apply  this  equation  so  long  as  the  canonical  distribu- 
tion is  possible.  The  equation  shows  that  as  6  is  increased 
(without  becoming  infinite)  —  ^r  cannot  become  infinite  unless 
e  simultaneously  becomes  infinite,  and  that  as  8  is  decreased 
(without  becoming  zero)  —  ^r  cannot  become  infinite  unless 
simultaneously  e  becomes  an  infinite  negative  quantity.  The 
corresponding  cases  in  thermodynamics  would  be  bodies  which 
could  absorb  or  give  out  an  infinite  amount  of  heat  without 
passing  certain  limits  of  temperature,  when  no  external  work 
is  done  in  the  positive  or  negative  sense.  Such  infinite  values 
present  no  analytical  difficulties,  and  do  not  contradict  the 
general  laws  of  mechanics  or  of  thermodynamics,  but  they 
are  quite  foreign  to  our  ordinary  experience  of  nature.  In 
excluding  such  cases  (which  are  certainly  not  entirely  devoid 
of  interest)  we  do  not  exclude  any  which  are  analogous  to 
any  actual  cases  in  thermodynamics. 

We  assume  then  that  for  any  finite  value  of  8  the  second 
member  of  (361)  has  a  finite  value. 

When  this  condition  is  fulfilled,  the  second  member  of 
(359)  will  vanish  for  u  =  e~+  V.    For,  if  we  set  8>  =  28, 

r=o  r  =  o 
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where  yf/  denotes  the  value  of  yjr  for  the  modulus  8'.  Since 
the  last  member  of  this  formula  vanishes  for  c  =  oo ,  the 
less  value  represented  by  the  first  member  must  also  vanish 
for  the  same  value  of  e.  Therefore  the  second  member  of 
(859),  which  differs  only  by  a  constant  factor,  vanishes  at 
the  upper  limit.  The  case  of  the  lower  limit  remains  to  be 
considered.    Now 


eF<   r-*» 


The  second  member  of  this  formula  evidently  vanishes  for 
the  value  of  c,  which  gives  V=0>  whether  this  be  finite  or 
negative  infinity.  Therefore,  the  second  member  of  (359) 
vanishes  at  the  lower  limit  also,  and  we  have 


de  0  de 


or  e      F=0.  (362) 

This  equation,  which  is  subject  to  no  restriction  in  regard  to 
the  value  of  n,  suggests  a  connection  or  analogy  between  the 
function  of  the  energy  of  a  system  which  is  represented  by 
€~*  V  and  the  notion  of  temperature  in  thermodynamics.  We 
shall  return  to  this  subject  in  Chapter  XIV. 

If  n  >  2,  the  second  member  of  (859)  may  easily  be  shown 
to  vanish  for  any  of  the  following  values  of  u  viz. :  £,  «*,  e, 
€*,  where  m  denotes  any  positive  number.  It  will  also 
vanish,  when  n  >  4,  for  u  =  d<f>/dey  and  when  n  >  2  A  for 
u  —  e~*<PV/d^.  When  the  second  member  of  (359)  van- 
ishes, and  n  >  2,  we  may  write 


( 


-.  /  d<f>      1  \  dd>      u  du  ,i*„«x 

—  ^-•)=-3?-5~~S-         (363) 


We  thus  obtain  the  following  equations : 
Ifn>2, 


<*-*>@-D=^?-i=-^     (364) 
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"    (Fiy=(S)'-*=-(5y'    *-» 

Ifn  >2A, 

«     d?*-  =  e6     *F  <370> 


—A  rf*+1  F  1 

whence  a~*  -^^  =  ^.  (371) 

Giving  h  the  values  1,  2,  3,  etc.,  we  have 


3-J.  «»>* 


as  already  obtained.    Also 

rf?  +  Sd?  &  +  V* )  ~ g*     lf  n  > 6>  (372) 

*  This  equation  may  also  be  obtained  from  equations  (252)  and  (821). 
Compare  also  equation  (349)  which  was  derived  by  an  approximative  method, 
t  Compare  equation  (360),  obtained  by  an  approximative  method. 
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If  Vq  is  a  continuous  increasing  function  of  e#  commencing 
with  Vt  =  0,  the  average  value  in  a  canonical  ensemble  of  any 
function  of  e#  either  alone  or  with  the  modulus  and  the  exter- 
nal coordinates,  is  given  by  equation  (275),  which  is  identical 
with  (357)  except  that  e,  </>,  and  ^r  have  the  suffix  (  )ff.  The 
equation  may  be  transformed  so  as  to  give  an  equation  iden- 
tical with  (859)  except  for  the  suffixes.  If  we  add  the  same 
suffixes  to  equation  (861),  the  finite  value  of  its  members  will 
determine  the  possibility  of  the  canonical  distribution. 

From  these  data,  it  is  easy  to  derive  equations  similar  to 
(860),  (862)-(872),  except  that  the  conditions  of  their  valid- 
ity  must  be  differently  stated.    The  equation 


requires  only  the  condition  already  mentioned  with  respect  to 
Vq.  This  equation  corresponds  to  (862),  which  is  subject  to 
no  restriction  with  respect  to  the  value  of  n.  We  may  ob- 
serve, however,  that  Fwill  always  satisfy  a  condition  similar 
to  that  mentioned  with  respect  to  Vr 

If  Vq  satisfies  the  condition  mentioned,  and  e**  a  similar 
condition,  t.  e.,  if  e**  is  a  continuous  increasing  function  of  er 
commencing  with  the  value  €*q  =  0,  equations  will  hold  sim- 
ilar to  those  given  for  the  case  when  n  >  2,  viz.,  similar  to 
(860),  (864)-(368).    Especially  important  is 

(kg  ~~  ©' 

H  Vqf  t+*  (or  dVJde^i  cPVJdeg2  all  satisfy  similar  conditions, 
we  shall  have  an  equation  similar  to  (869),  which  was  subject 
to  the  condition  n  >  4.  And  if  cP  Vqldeqz  also  satisfies  a 
similar  condition,  we  shall  have  an  equation  similar  to  (372), 
for  which  the  condition  was  n  >  6.  Finally,  if  Vq  and  A  suc- 
cessive differential  coefficients  satisfy  conditions  of  the  kind 
mentioned,  we  shall  have  equations  like  (370)  and  (371)  for 
which  the  condition  was  n  >  2  h. 
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These  conditions  take  the  place  of  those  given  above  relat- 
ing to  n.  In  fact,  we  might  give  conditions  relating  to  the 
differential  coefficients  of  V,  similar  to  those  given  relating  to 
the  differential  coefficients  of  Vq,  instead  of  the  conditions 
relating  to  n,  for  the  validity  of  equations  (860),  (868)-(872). 
This  would  somewhat  extend  the  application  of  the  equations. 
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CHAPTER  X. 

ON  A   DISTRIBUTION  IN   PHASE    CALLED  MICBOCANONI- 
CAL  IN  WHICH  ALL  THE  SYSTEMS  HAVE 
THE  SAME  ENERGY. 

An  important  case  of  statistical  equilibrium  is  that  in  which 
all  systems  of  the  ensemble  have  the  same  energy.  We  may 
arrive  at  the  notion  of  a  distribution  which  will  satisfy  the 
necessary  conditions  by  the  following  process.  We  may 
suppose  that  an  ensemble  is  distributed  with  a  uniform  den- 
sity-in-phase  between  two  limiting  values  of  the  energy,  4  and 
e",  and  with  density  zero  outside  of  those  limits.  Such  an 
ensemble  is  evidently  in  statistical  equilibrium  according  to 
the  criterion  in  Chapter  IV,  since  the  density-in-phase  may  be 
regarded  as  a  function  of  the  energy.  By  diminishing  the 
difference  of  e'  and  e\  we  may  diminish  the  differences  of 
energy  in  the  ensemble.  The  limit  of  this  process  gives  us 
a  permanent  distribution  in  which  the  energy  is  constant. 

We  should  arrive  at  the  same  result,  if  we  should  make  the 
density  any  function  of  the  energy  between  the  limits  e'  and 
c",  and  zero  outside  of  those  limits.  Thus,  the  limiting  distri- 
bution obtained  from  the  part  of  a  canonical  ensemble 
between  two  limits  of  energy,  when  the  difference  of  the 
limiting  energies  is  indefinitely  diminished,  is  independent  of 
the  modulus,  being  determined  entirely  by  the  energy,  and 
is  identical'  with  the  limiting  distribution  obtained  from  a 
uniform  density  between  limits  of  energy  approaching  the 
same  value. 

We  shall  call  the  limiting  distribution  at  which  we  arrive 
by  this  process  microcanonical. 

We  shall  find  however,  in  certain  cases,  that  for  certain 
values  of  the  energy,  viz.,  for  those  for  which  e*  is  infinite, 
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this  process  fails  to  define  a  limiting  distribution  in  any  such 
distinct  sense  as  for  other  values  of  the  energy.  The  difficulty 
is  not  in  the  process,  but  in  the  nature  of  the  case,  being 
entirely  analogous  to  that  which  we  meet  when  we  try  to  find 
a  canonical  distribution  in  cases  when  yfr  becomes  infinite. 
We  have  not  regarded  such  cases  as  affording  true  examples 
of  the  canonical  distribution,  and  we  shall  not  regard  the  cases 
in  which  e*  is  infinite  as  affording  true  examples  of  the  micro- 
canonical  distribution.  We  shall  in  fact  find  as  we  go  on  that 
in  such  cases  our  most  important  formulae  become  illusory. 

The  use  of  formulae  relating  to  a  canonical  ensemble  which 
contain  e*de  instead  of  dpt . . .  dqH,  as  in  the  preceding  chapters, 
amounts  to  the  consideration  of  the  ensemble  as  divided  into 
an  infinity  of  microcanonical  elements. 

From  a  certain  point  of  view,  the  microcanonical  distribution 
may  seem  more  simple  than  the  canonical,  and  it  has  perhaps 
been  more  studied,  and  been  regarded  as  more  closely  related 
to  the  fundamental  notions  of  thermodynamics.  To  this  last 
point  we  shall  return  in  a  subsequent  chapter.  It  is  sufficient 
here  to  remark  that  analytically  the  canonical  distribution  is 
much  more  manageable  than  the  microcanonical. 

We  may  sometimes  avoid  difficulties  which  the  microcanon- 
ical distribution  presents  by  regarding  it  as  the  result  of  the 
following  process,  which  involves  conceptions  less  simple  but 
more  amenable  to  analytical  treatment.  We  may  suppose  an 
ensemble  distributed  with  a  density  proportional  to 

e      *    » 

where  a>  and  e7  are  constants,  and  then  diminish  indefinitely 
the  value  of  the  constant  a>.  Here  the  density  is  nowhere 
2#ro  until  we  come  to  the  limit,  but  at  the  limit  it  is  zero  for 
all  energies  except  e'.  We  thus  avoid  the  analytical  compli- 
cation of  discontinuities  in  the  value  of  the  density,  which 
require  the  use  of  integrals  with  inconvenient  limits. 

In  a  microcanonical  ensemble  of  systems  the  energy  (e)  is 
constant,  but  the  kinetic  energy  (e,)  and  the  potential  energy 
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(eq)  vary  in  the  different  systems,  subject  of  course  to  the  con- 
dition 

€p  +  eq  =  e  =  constant.  (373) 

Our  first  inquiries  will  relate  to  the  division  of  energy  into 
these  two  parts,  and  to  the  average  values  of  functions  of  ep 
and  er 

We  shall  use  the  notation  w|f  to  denote  an  average  value  in 
a  microcanonical  ensemble  of  energy  6.  An  average  value 
in  a  canonical  ensemble  of  modulus  8,  which  has  hitherto 
been  denoted  by  uy  we  shall  in  this  chapter  denote  by  3|e,  to 
distinguish  more  clearly  the  two  kinds  of  averages. 

The  extension-in-phase  within  any  limits  which  can  be  given 
in  terms  of  ep  and  eq  may  be  expressed  in  the  notations  of  the 
preceding  chapter  by  the  double  integral 


JfdVpdVt 


taken  within  those  limits.  If  an  ensemble  of  systems  is  dis- 
tributed within  those  limits  with  a  uniform  density-in-phase, 
the  average  value  in  the  ensemble  of  any  function  (u)  of  the 
kinetic  and  potential  energies  will  be  expressed  by  the  quotient 
of  integrals 

P  '  udVpdVq 


//« 


//■ 


dVpdVq 


Since  d  Vp  =  e**  de^  and  d^  =  <2e  when  eq  is  constant,  the 
expression  may  be  written 


//< 


ue*pdcdVt 


17' 


e+pdedV0 


Q 


To  get  the  average  value  of  u  in  an  ensemble  distributed 
microcanonically  with  the  energy  6,  we  must  make  the  in- 
tegrations cover  the  extension-in-phase  between  the  energies 
€  and  e  +  de.    This  gives 
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^  = 


de  J  u 

Fg=0 


?pdV9 


««=• 


*J* 


6+PdV9 


F«=0 

But  by  (299)  the  value  of  the  integral  in  the  denominator 
is  e*.    We  have  therefore 

tf*  =  *~*  fue^dV,,  (374) 

p*=o 

where  e*?  and  Vq  are  connected  by  equation  (878),  and  u,  if 
given  as  function  of  e^  or  of  ep  and  e9,  becomes  in  virtue  of 
the  same  equation  a  function  of  eq  alone. 

We  shall  assume  that  e*  has  a  finite  value.  If  n  >  1,  it  is 
evident  from  equation  (805)  that  e*  is  an  increasing  function 
of  6,  and  therefore  cannot  be  infinite  for  one  value  of  e  without 
being  infinite  for  all  greater  values  of  e,  which  would  make 
—  ^r  infinite.*  When  »  >  1,  therefore,  if  we  assume  that  e* 
is  finite,  we  only  exclude  such  cases  as  we  found  necessary 
to  exclude  in  the  study  of  the  canonical  distribution.  But 
when  n  =  1,  cases  may  occur  in  which  the  canonical  distribu- 
tion is  perfectly  applicable,  but  in  which  the  formulae  for  the 
microcanonical  distribution  become  illusory,  for  particular  val- 
ues of  e,  on  account  of  the  infinite  value  of  t+.  Such  failing 
cases  of  the  microcanonical  distribution  for  particular  values 
of  the  energy  will  not  prevent  us  from  regarding  the  canon- 
ical ensemble  as  consisting  of  an  infinity  of  microcanonical 
ensembles,  f 


*  See  equation  i 

t  An  example  of  the  failing  case  of  the  microcanonical  distribution  is 
afforded  by  a  material  point,  under  the  influence  of  gravity,  and  constrained 
to  remain  in  a  vertical  circle.  The  failing  case  occurs  when  the  energy  is 
just  sufficient  to  carry  the  material  point  to  the  highest  point  of  the  circle. 

It  will  be  observed  that  the  difficulty  is  inherent  in  the  nature  of  the  case, 
and  is  quite  independent  of  the  mathematical  formulae.  The  nature  of  the 
difficulty  is  at  once  apparent  if  we  try  to  distribute  a  finite  number  of 
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From  the  last  equation,  with  (298),  we  get 

e~*prp\.  =  6~*JrPdrq  =  i*  v.  (375) 

r«=o 
But  by  equations  (288)  and  (289) 

e~VP  =  ?€,.  (376) 

Therefore  

e~V*=e~*pFj.  =  ?£!..  (377) 

Again,  with  the  aid  of  equation  (801),  we  get 
if  n  >  2.    Therefore,  by  (289), 

ff=fl.=G-1)^1" a  ">2-     (J79) 

These  results  are  interesting  on  account  of  the  relations  of 

the  functions  e~+  V  and  -£  to  the  notion  of  temperature  in 

thermodynamics,  —  a  subject  to  which  we  shall  return  here- 
after. They  are  particular  cases  of  a  general  relation  easily 
deduced  from  equations  (806),  (874),  (288)  and  (289).  We 
have 

*?=J    *fiV"    lf   *<*»  +  1- 

The  equation  may  be  written 

-4>d*V       -+*f'-+pd*Vp  <hJV 
v9=o 

material  points  with  this  particular  value  of  the  energy  as  nearly  as  possible 
in  statistical  equilibrium,  or  if  we  ask :  What  is  the  probability  that  a  point 
taken  at  random  from  an  ensemble  in  statistical  equilibrium  with  this  ralue 
of  the  energy  will  be  found  in  any  specified  part  of  the  circle  1 
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We  have  therefore 


-4>dkV      -4>Pd*Vp 


=  6 


d?~"        deSk-ran-h  +  l)**"^'     (380) 

if  A<  Jn  +  1,    For  example,  when  w  is  even,  we  may  make 
A  =  J  n,  which  gives,  with  (307), 

(2^(r,)f9=.  =  ra«)€71-  (881) 

Since  any  canonical  ensemble  of  systems  may  be  regarded 
as  composed  of  microcanonical  ensembles,  if  any  quantities 
u  and  v  have  the  same  average  values  in  every  microcanonical 
ensemble,  they  will  have  the  same  values  in  every  canonical 
ensemble.  To  bring  equation  (380)  formally  under  this  rule, 
we  may  observe  that  the  first  member  being  a  function  of  e  is 
a  constant  value  in  a  microcanonical  ensemble,  and  therefore 
identical  with  its  average  value.  We  get  thus  the  general 
equation 

6       cfc*|e"~w     "   &P*Je      r(*n-A  +  l) 
if  h  <  J  w  +  1.*    The  equations 

2 
n 


SL-rTI^fen^-^   <» 


&  =  ^Vl  =  e-*PVX  =  l^,  (383) 


B~de\9~dePU~\2      L)**    ,e' 


(384) 


may  be  regarded  as  particular  cases  of  the  general  equation. 
The  last  equation  is  subject  to  the  condition  that  n  >  2. 

The  last  two  equations  give  for  a  canonical  ensemble, 
if  »  >  2, 

(l-|)*I.V*].  =  l-  (385) 

The  corresponding  equations  for  a  microcanonical  ensemble 
give,  if  n  >  2, 

*  See  equation 
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which  shows  that  d<f>  dlog  V  approaches  the  value  unity 
when  n  is  very  great. 

If  a  system  consists  of  two  parts,  haying  separate  energies, 
we  may  obtain  equations  similar  in  form  to  the  preceding, 
which  relate  to  the  system  as  thus  divided.*  We  shall 
distinguish  quantities  relating  to  the  parts  by  letters  with 
suffixes,  the  same  letters  without  suffixes  relating  to  the 
whole  system.  The  extension-in-phase  of  the  whole  system 
within  any  given  limits  of  the  energies  may  be  represented  by 
the  double  integral 


// 


dVxdV* 


taken  within  those  limits,  as  appears  at  once  from  the  defini- 
tions of  Chapter  VIII.  In  an  ensemble  distributed  with 
uniform  density  within  those  limits,  and  zero  density  outside, 
the  average  value  of  any  function  of  ex  and  e»  is  given  by  the 
quotient 

udVtdVt 


//■ 


77 


dVxdV% 
which  may  also  be  written  f 

f Jue^ckdVi 

f  fe*xdedV% 
If  we  make  the  limits  of  integration  e  and  e  +  de,  we  get  the 

*  If  this  condition  is  rigorously  fulfilled,  the  parts  will  have  no  influence 
on  each  other,  and  the  ensemble  formed  by  distributing  the  whole  micro- 
canonically  is  too  arbitrary  a  conception  to  have  a  real  interest.  The  prin- 
cipal interest  of  the  equations  which  we  shall  obtain  will  be  in  cases  in 
which  the  condition  is  approximately  fulfilled.  But  for  the  purposes  of  a 
theoretical  discussion,  it  is  of  course  convenient  to  make  such  a  condition 
absolute.  Compare  Chapter  IV,  pp.  35  ff.,  where  a  similar  condition  is  con- 
sidered in  connection  with  canonical  ensembles. 

t  Where  the  analytical  transformations  are  identical  in  form  with  those 
on  the  preceding  pages,  it  does  not  appear  necessary  to  give  all  the  steps 
with  the  same  detail. 
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average  value  of  u  in  an  ensemble  in  which  the  whole  system 
is  microcanonically  distributed  in  phase,  viz., 

**? 

^^e+jue^dV*,  (387) 

n=o 
where  ^  and  V%  are  connected  bjrthe  equation 

€X  +  e,  =  constant  =  €,  (388) 

and  u,  if  given  as  function  of  eu  or  of  €i  and  €,,  becomes  in 

virtue  of  the  same  equation  a  function  of  e%  alone.* 

Thus 

7^r^^r¥jvldVu  (389) 

e^r=7*^  =  7^r&.  (390) 

This  requires  a  similar  relation  for  canonical  averages 

0  =  <TV|e  =  <T*Pi|e  =  ^r\9.  (391) 

Again 

^.=f*7tsdr>-       (s92) 

But  if  74  >  2,  e+*  vanishes  for  Vx  =  0,f  and 

F.T30  F.T30 

Hence,  if  7^  >  2,  and  n%  >  2, 


cfe       <fei  |«      <ka  k 


(394) 


*  In  the  applications  of  the  equation  (387),  we  cannot  obtain  all  the  result* 
corresponding  to  those  which  we  have  obtained  from  equation  (874),  because 
<pp  is  a  known  function  of  tP,  while  <t>\  must  be  treated  as  an  arbitrary  func- 
tion of  «i,  or  nearly  so. 

t  See  Chapter  VIII,  equations  (806)  and  (816). 
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We  have  compared  certain  functions  of  the  energy  of  the 
whole  system  with  average  values  of  similar  functions  of 
the  kinetic  energy  of  the  whole  system,  and  with  average 
values  of  similar  functions  of  the  whole  energy  of  a  part  of 
the  system.  We  may  also  compare  the  same  functions  with 
average  values  of  the  kinetic  energy  of  a  part  of  the  system. 

We  shall  express  the  total,  kinetic,  and  potential  energies  of 
the  whole  system  by  €,  Cp,  and  €q,  and  the  kinetic  energies  of  the 
parts  by  e^  and  e^  These  kinetic  energies  are  necessarily  sep- 
arate :  we  need  not  make  any  supposition  concerning  potential 
energies.  The  extension-in-phase  within  any  limits  which  can 
be  expressed  in  terms  of  er  e^  e^  may  be  represented  in  the 
notations  of  Chapter  \U1  by  the  triple  integral 


fffdV»dV*dV< 


taken  within  those  limits.  And  if  an  ensemble  of  systems  is 
distributed  with  a  uniform  density  within  those  limits,  the 
average  value  of  any  function  of  e0  e^,  e^  will  be  expressed 
by  the  quotient 

fffudVlpdr,pdV9 

f%dVlpdVtpdVq 


or 


f  f  fu6+vdedV%pdVq 
fffe+vdedr^dV, 


To  get  the  average  value  of  u  for  a  microcanonical  distribu- 
tion, we  must  make  the  limits  e  and  e  +  de.  The  denominator 
in  this  case  becomes  e*  de,  and  we  have 


*,=€      C2p=«— €g 


^  =  «J        fue^dV^dr,,  (396) 

r«=o    €,,,=0 
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where  $&>  V^,,  and  Vq  are  connected  by  the  equation 
«iP  +  €ip  +  eq  =  constant  =  e 
Accordingly 

f g=f       f S|^=f — «ff 

e-*»Vlp\,  =  *~*f     Jv^iV+iV^f+V,      (397) 
and  we  may  write 

rV-.^rJ.-.^rJ^i^l-J.a,    (398) 

and 


th 


*i 


0  =  7V|e  =  e-**Vlp[>  =  <T**%|e  =  »^e  =  £«*•  (399) 


Again,  if  nx  >  2, 


<folp 


-"7  /  j&'*''v. 


JV=0     tap=0 


Hence,  if  n^  >  2,  and  n^  >  2, 


d <f> d<f>ip 


de       de] 


ip  !• 


_^TP 


d€.t 


=  (*%- 1)^  =  ^-1)^,    (401) 


0""  de  e 


=  |FL =S =(i«i-l)V,Tt=(l%-l)v\.  (402) 


We  cannot  apply  the  methods  employed  in  the  preceding 
pages  to  the  microcanonical  averages  of  the  (generalized) 
forces  A v  Ay  etc.,  exerted  by  a  system  on  external  bodies, 
since  these  quantities  are  not  functions  of  the  energies,  either 
kinetic  or  potential,  of  the  whole  or  any  part  of  the  system. 
We  may  however  use  the  method  described  on  page  116. 
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Let  us  imagine  an  ensemble  of  systems  distributed  in  phase 
according  to  the  index  of  probability 

where  c7  is  any  constant  which  is  a  possible  value  of  the 
energy,  except  only  the  least  value  which  is  consistent  with 
the  values  of  the  external  coordinates,  and  c  and  a>  are  other 
constants.    We  have  therefore 


/■•■/ 

phiufit 


—  C 

or  e 


dPi--.dqn  =  l,  (403) 

=J-.Je      *    dPl...dq„  (404) 


oragaui  e     =  f    e      *         de.  (405) 


6     =  I      e      *         de. 


From  (404)  we  have 


=J      2€—J--l&e      *         A,  (406) 

where- ^]f  denotes  the  average  value  of  -4X  in  those  systems 
of  the  ensemble  which  have  any  same  energy  e.  (This 
is  the  same  thing  as  the  average  value  of  At  in  a  microcanoni- 
cal  ensemble  of  energy  €.)  The  validity  of  the  transformation 
is  evident,  if  we  consider  separately  the  part  of  each  integral 
which  lies  between  two  infinitesimally  differing  limits  of 
energy.    Integrating  by  parts,  we  get 
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FS=aO  N  ' 

Differentiating  (405),  we  get 

f-o  'r=a> 

where  €a  denotes  the  least  value  of  e  consistent  with  the  exter- 
nal coordinates.  The  last  term  in  this  equation  represents  the 
part  of  de~c/dal  which  is  due  to  the  variation  of  the  lower 
limit  of  the  integral.  It  is  evident  that  the  expression  in  the 
brackets  will  vanish  at  the  upper  limit.  At  the  lower  limit, 
at  which  ep  =  0,  and  eq  has  the  least  value  consistent  with  the 
external  coordinates,  the  average  sign  on  3^],  is  superfluous, 
as  there  is  but  one  value  of  Ax  which  is  represented  by 
—  dea  /day  Exceptions  may  indeed  occur  for  particular  values 
of  the  external  coordinates,  at  which  d€a/dat  receive  a  finite 
increment,  and  the  formula  becomes  illusory.  Such  particular 
values  we  may  for  the  moment  leave  out  of  account.  The 
last  term  of  (408)  is  therefore  equal  to  the  first  term  of  the 
second  member  of  (407).  (We  may  observe  that  both  vanish 
when  n  >  2  on  account  of  the  factor  e*.y 
We  have  therefore  from  these  equations 

f=Q0  (c-fOa  f=a0        (c-tO* 


or 


/(f^-SK"^*-"-    <m 


That  is :  the  average  value  in  the  ensemble  of  the  quantity 
represented  by  the  principal  parenthesis  is  zero.    This  must 
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be  true  for  any  value  of  ».  If  we  diminish  <»,  the  average 
value  of  the  parenthesis  at  the  limit  when  a>  vanishes  beoomes 
identical  with  the  value  for  e  saV.  But  this  may  be  any  value 
of  the  energy,  except  the  least  possible.    We  have  therefore 

unless  it  be  for  the  least  value  of  the  energy  consistent  with 
the  external  coordinates,  or  for  particular  values  of  the  ex- 
ternal coordinates.  But  the  value  of  any  term  of  this  equa- 
tion as  determined  for  particular  values  of  the  energy  and 
of  the  external  coordinates  is  not  distinguishable  from  its 
value  as  determined  for  values  of  the  energy  and  external 
coordinates  indefinitely  near  those  particular  values.  The 
equation  therefore  holds  without  limitation.  Multiplying 
by  «*,  we  get 

The  integral  of  this  equation  is 

2^*==^  +  *i>  (412) 

where  Ft  is  a  function  of  the  external  coordinates.  We  have 
an  equation  of  this  form  for  each  of  the  external  coordinates. 
This  gives,  with  (266),  for  the  complete  value  of  the  differen- 
tial of  V 

dV  =  6+de+(e+J^-F1)dal  +  (e+JJ,- F9)d<H  +  etc.,  (413) 

or 

dV=  e*{de  +  l^dox  +  l^da*  +  etc.)  —  Fx  da^  —  F%da%  —  etc. 

(414) 

To  determine  the  values  of  the  functions  Fx ,  F% ,  etc.,  let 

us  suppose  «! ,  Oj ,  etc.  to  vary  arbitrarily,  while  e  varies  so 

as  always  to  have  the  least  value  consistent  with  the  values 

of  the  external  coordinates.    This  will  make    V=  0,  and 

dV-=.  0.     If  n  <  2,  we  shall  have  also  e*  =  0,  which  will 

give 

Ji  =  0,    -Fi  =  0,    etc.  (416) 


Digitized  by  VjOOQIC 


128  THE  MICROCANONICAL  DISTRIBUTION. 

The  result  is  the  same  for  any  value  of  n.  For  in  the  varia- 
tions considered  the  kinetic  energy  will  be  constantly  zero, 
and  the  potential  energy  will  have  the  least  value  consistent 
with  the  external  coordinates.  The  condition  of  the  least 
possible  potential  energy  may  limit  the  ensemble  at  each  in- 
stant to  a  single  configuration,  or  it  may  not  do  so ;  but  in  any 
case  the  values  of  A1 ,  A^  etc.  will  be  the  same  at  each  instant 
for  all  the  systems  of  the  ensemble,*  and  the  equation 

de  +  Aidax  +  A%doug  +  etc.  =  0 

will  hold  for  the  variations  considered.  Hence  the  functions 
JPj ,  -Pa,  etc.  vanish  in  any  case,  and  we  have  the  equation 

d  V=  e+de  +  e+Z^dO!  +  e^J^dc^  +  etc.,  (416) 

de+lQ-ddi  +  3iLrfo,  +  etc. 

d  log  r  = e_,Fw ,        (4i7) 

or  again 

de  =  e~+  VdlogV-1^]tda1-'2d€daf  -  etc.         (418) 

It  will  be  observed  that  the  two  last  equations  have  the  form 
of  the  fundamental  differential  equations  of  thermodynamics, 
er+V  corresponding  to  temperature  and  log  V  to  entropy. 
We  have  already  observed  properties  of  «-♦  V  suggestive  of  an 
analogy  with  temperature,  f  The  significance  of  these  facts 
will  be  discussed  in  another  chapter. 
The  two  last  equations  might  be  written  more  simply 

,_      de  +  Aj\9  dOj  +  3^|,  da*  +  etc. 

de  =  e~*  d  P  —  "37|#  dat  —  jQ*  da%  —  etc., 

and  still  have  the  form  analogous  to  the  thermodynamic 
equations,  but  er+  has  nothing  like  the  analogies  with  tempera- 
ture which  we  have  observed  in  er+  V. 

*  This  statement,  as  mentioned  before,  may  hare  exceptions  for  particular 
rallies  of  the  external  coordinates.  This  will  not  invalidate  the  reasoning, 
which  has  to  do  with  varying  values  of  the  external  coordinates. 

t  See  Chapter  IX,  page  111 ;  also  this  chapter,  page  119. 
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MAXIMUM  AND  MINIMUM  PROPERTIES  OF  VARIOUS  DIS- 
TRIBUTIONS IN  PHA8E. 

In  the  following  theorems  we  suppose,  as  always,  that  the 
systems  forming  an  ensemble  are  identical  in  nature  and  in 
the  values  of  the  external  coordinates,  which  are  here  regarded 
as  constants* 

Theorem  L  If  an  ensemble  of  systems  is  so  distributed  in 
phase  that  the  index  of  probability  is  a  function  of  the  energy, 
the  average  value  of  the  index  is  less  than  for  any  other  distri- 
bution in  which  the  distribution  in  energy  is  unaltered* 

Let  us  write  q  for  the  index  which  is  a  function  of  the 
energy,  and  17  +  Aip  for  any  other  which  gives  the  same  dis- 
tribution in  energy.    It  is  to  be  proved  that 

401  401 


where  17  is  a  function  of  the  energy,  and  A*?  a  function  of  the 
phase,  which  are  subject  to  the  conditions  that 
401  an 

f.-..fe*AUp1...dqm=f...feUpl...dqm  =  lf  (420) 


and  that  for  any  value  of  the  energy  (/) 

J.  .  .fe+^Ufr  ...dq.czf..  .fjdfr  ...dq%.      (421) 

Equation  (420)  expresses  the  general  relations  which  17  and 
tf  +  A17  must  satisfy  in  order  to  be  indices  of  any  distributions, 
and  (421)  expresses  the  condition  that  they  give  the  same 
distribution  in  eneigy. 
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Since  17  is  a  function  of  the  energy,  and  may  therefore  be  re- 
garded as  a  constant  within  the  limits  of  integration  of  (421), 
we  may  multiply  by  17  under  the  integral  sign  in  both  mem- 
bers, which  gives 

/  •  •  •  /  ti^^dpx  ...dqu=  J  .  . .  I  yfdpx  . .  .  dqu. 

Since  this  is  true  within  the  limits  indicated,  and  for  every 
value  of  e\  it  will  be  true  if  the  integrals  are  taken  for  all 
phases.    We  may  therefore  cancel  the  corresponding  parts  of 
(419),  which  gives 
an 

f. .  .fbrieP^dpi . . .  dqu> 0.  (422) 

phatas 

But  by  (420)  this  is  equivalent  to 
•n 
f.  . .  f  (Ai?«**  +  1  -  ***)*&! .  • .  dqm  >  0.         (423) 


Now  A17  eP*  +  1  —  e**  is  a  decreasing  function  of  A17  for  nega- 
tive values  of  A*?,  and  an  increasing  function  of  A*?  for  positive 
values  of  A*?.  It  vanishes  for  A17  =  0.  The  expression  is 
therefore  incapable  of  a  negative  value,  and  can  have  the  value 
0  only  for  A17  =  0.  The  inequality  (428)  will  hold  therefore 
unless  A17  =  0  for  all  phases.  The  theorem  is  therefore 
proved. 

Theorem  IL  If  an  ensemble  of  systems  is  canonically  dis- 
tributed in  phase,  the  average  index  of  probability  is  less  than 
in  any  other  distribution  of  the  ensemble  having  the  same 
average  energy. 

For  the  canonical  distribution  let  the  index  be  (^r  —  «)/6, 
and  for  another  having  the  same  average  energy  let  the  index 
be  (^r  —  e)/6  +  A17,  where  A*?  is  an  arbitrary  function  of  the 
phase  subject  only  to  the  limitation  involved  in  the  notion  of 
the  index,  that 


Digitized  by  VjOOQIC 


MAXIMUM  AND  MINIMUM  PROPERTIES.         131 
an       A—  t  in       a  ^ 

J...J«e  dpl...dq%=J...je*  dpx . . .  d^.  =  l, 


(424) 
and  to  that  relating  to  the  constant  average  energy,  that 

all  f-4  all  ^_t 

/  •  •  •  J  €«  ®         dpx...dqn  as J  ...lee9  dfr  . . .  dq%.    (425) 


It  is  to  be  proved  that 

all  A    t 

pluatw 

all  ^_€ 

/•••/(•-J)*"5"*  •••*•■   <426> 

Now  in  virtue  of  the  first  condition  (424)  we  may  cancel  the 
constant  term  ^  /  6  in  the  parentheses  in  (426),  and  in  virtue 
of  the  second  condition  (425)  we  may  cancel  the  term  e/6. 
The  proposition  to  be  proved  is  thus  reduced  to 

all  +__* 

...  I  Arje  B         dpx . . .  dqn  >  0, 


which  may  be  written,  in  virtue  of  the  condition  (424), 

aU  f-* 

f. .  .f(*neLn  +  1  -  e*fl)e*~dp1  ...dq*>0.       (427) 

In  this  form  its  truth  is  evident  for  the  same  reasons  which 
applied  to  (428). 

Theorem  227.  If  6  is  any  positive  constant,  the  average 
value  in  an  ensemble  of  the  expression  t)  +  e/  6  (tf  denoting 
as  usual  the  index  of  probability  and  e  the  energy)  is  less  when 
the  ensemble  is  distributed  canonically  with  modulus  6,  than 
for  any  other  distribution  whatever. 

In  accordance  with  our  usual  notation  let  us  write 
(^r  —  e)  /6  for  the  index  of  the  canonical  distribution.  In  any 
other  distribution  let  the  index  be  (^r  —  c)/6  +  A17. 
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In  the  canonical  ensemble  i\  +  tj  6  has  the  constant  value 
yfr/S;  in  the  other  ensemble  it  has  the  value  ^r/B  +  A17. 
The  proposition  to  be  proved  may  therefore  be  written 


where 


plMMt 


all         4^5+ a  ^         t^ 

f...  Ce  B         dpl...dqm=f...fe  *  ^...^sL     (429) 


In  virtue  of  this  condition,  since  ^r/6  is  constant,  the  propo- 
sition to  be  proved  reduces  to 

0<f...f±i,ee  dqx...dpn,  (430) 

where  the  demonstration  may  be  concluded  as  in  the  last 
theorem. 

If  we  should  substitute  for  the  energy  in  the  preceding 
theorems  any  other  function  of  the  phase,  the  theorems,  mu- 
tatis mutandis,  would  still  hold.  On  account  of  the  unique 
importance  of  the  energy  as  a  function  of  the  phase,  the  theo- 
rems as  given  are  especially  worthy  of  notice.  When  the  case 
is  such  that  other  functions  of  the  phase  have  important 
properties  relating  to  statistical  equilibrium,  as  described 
in  Chapter  IV,*  the  three  following  theorems,  which  are 
generalizations  of  the  preceding,  may  be  useful.  It  will  be 
sufficient  to  give  them  without  demonstration,  as  the  principles 
involved  are  in  no  respect  different. 

Theorem  IV.  If  an  ensemble  of  systems  is  so  distributed  in 
phase  that  the  index  of  probability  is  any  function  of  Fv  Fv 
etc.,  (these  letters  denoting  functions  of  the  phase,)  the  average 
value  of  the  index  is  less  than  for  any  other  distribution  in 
phase  in  which  the  distribution  with  respect  to  the  functions 
Fv  Fv  etc.  is  unchanged. 

•  See  pages  87-41. 
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Theorem  V.  If  an  ensemble  of  systems  is  so  distributed 
in  phase  that  the  index  of  probability  is  a  linear  function  of 
Fv  Fv  etc.,  (these  letters  denoting  functions  of  the  phase,)  the 
average  value  of  the  index  is  less  than  for  any  other  distribu- 
tion in  which  the  functions  Fv  Fv  etc.  have  the  same  average 
values. 

Theorem  VI.  The  average  value  in  an  ensemble  of  systems 
of  rj  +  F  (where  t)  denotes  as  usual  the  index  of  probability  and 
.7  any  function  of  the  phase)  is  less  when  the  ensemble  is  so 
distributed  that  rj  +  Fib  constant  than  for  any  other  distribu- 
tion whatever. 

Theorem  VH.  If  a  system  which  in  its  different  phases 
constitutes  an  ensemble  consists  of  two  parts,  and  we  consider 
the  average  index  of  probability  for  the  whole  system,  and 
also  the  average  indices  for  each  of  the  parts  taken  separately, 
the  sum  of  the  average  indices  for  the  parts  will  be  either  less 
than  the  average  index  for  the  whole  system,  or  equal  to  it, 
but  cannot  be  greater.  The  limiting  case  of  equality  occurs 
when  the  distribution  in  phase  of  each  part  is  independent  of 
that  of  the  other,  and  only  in  this  case. 

Let  the  coordinates  and  momenta  of  the  whole  system  be 
Ji  •  •  •  y«»  JPi » •  •  •  P**  °f  which  jj . . .  qm  p^ , . .  .pm  relate  to  one 
part  of  the  system,  and  qm¥l , . . .  q*,  pm¥l , . .  .pn  to  the  other. 
If  the  index  of  probability  for  the  whole  system  is  denoted  by 
f?,  the  probability  that  the  phase  of  an  unspecified  system  lies 
within  any  given  limits  is  expressed  by  the  integral 

I  ...  I  fdpi .  . .  dqn  (431) 

taken  for  those  limits.    If  we  set 

J  "J  S*Pmn  •  •  •  <*Msw . . ,  <k*  =  «*i  (482) 

where  the  integrations  cover  all  phases  of  the  second  system, 
and 

f. .  .JV  dfc  . . .  dpmdqx  ...dqm  =  e*  (433) 
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where  the  integrations  cover  all  phases  of  the  first  system, 
the  integral  (481)  will  reduce  to  the  form 

J. .  . JV1  (fa...  dpmdqx  ...dq^  (434) 

when  the  limits  can  be  expressed  in  terms  of  the  coordinates 
and  momenta  of  the  first  part  of  the  system.  The  same  integral 
will  reduce  to 

J    .  .Je*  dp^x  ...dpn  dq^  ...dq„  (435) 

when  the  limits  can  be  expressed  in  terms  of  the  coordinates 
and  momenta  of  the  second  part  of  the  system*    It  is  evident 
thati/1  and  i?2  are  the  indices  of  probability  for  the  two  parts 
of  the  system  taken  separately. 
The  main  proposition  to  be  proved  may  be  written 

l...lril**dpl...dqm+f...j  it  a*  dpmVk  ...dqm< 

f..-fie'dpl...dqnf    (436) 

where  the  first  integral  is  to  be  taken  over  all  phases  of  the  first 
part  of  the  system,  the  second  integral  over  all  phases  of  the 
second  part  of  the  system,  and  the  last  integral  over  all  phases 
of  the  whole  system.    Now  we  have 

f...j*eUpl...dqn  =  l,  (437) 

C. . .  fe^dpt  ...dqm  =  lt  (438) 

and  J. .  .fe*dPm¥l . . .  dqn  =  1,  (439) 

where  the  limits  cover  in  each  case  all  the  phases  to  which  the 
variables  relate.  The  two  last  equations,  which  are  in  them- 
selves evident,  may  be  derived  by  partial  integration  from  the 
first. 
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It  appears  from  the  definitions  of  f)t  and  vt  that  (486)  may 
also  be  written 

f .  •  • fviftyi  •  •  •  <*Sfc  +J    •  -Jv**dPi  •  •  •  *•  £ 

f...fl*#i...Sb,    (440) 

or  J. . . Jfo  —  in  -  yje'dpi  ...«*&.£  0, 

where  the  integrations  coyer  all  phases.    Adding  the  equation 
f...J***>dpx...aq%  =  l9  (441) 

which  we  get  by  multiplying  (488)  and  (439),  and  subtract- 
ing (487),  we  have  for  the  proposition  to  be  proved 
•n 

f---f[b-  +  -nd*  +  ^-f}*i---**Z*-    (442) 


Let 

«*  =  i?  —  in  —  i?i-  (443) 

The  main  proposition  to  be  proved  may  be  written 
an 

f. .  -[(**  +  1  -  O******  •  •  •  <%*  £  °-  (444) 


This  is  evidently  true  since  the  quantity  in  the  parenthesis  is 
incapable  of  a  negative  value.*  Moreover  the  sign  =  can 
hold  only  when  the  quantity  in  the  parenthesis  vanishes  for 
all  phases,  t.  e.,  when  u  =  0  for  aU  phases.  This  makes 
rj  =  qt  +  q%  for  all  phases,  which  is  the  analytical  condition 
which  expresses  that  the  distributions  in  phase  of  the  two 
parts  of  the  system  are  independent. 

Theorem  VUL  If  two  or  more  ensembles  of  systems  which 
are  identical  in  nature,  but  may  be  distributed  differently  in 
phase,  are  united  to  form  a  single  ensemble,  so  that  the  prob- 
ability-coefficient of  the  resulting  ensemble  is  a  linear  function 

•  See  Theorem  I,  where  this  is  prored  of  a  similar  expression. 
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of  the  probability-coefficients  of  the  original  ensembles,  the 
average  index  of  probability  of  the  resulting  ensemble  cannot 
be  greater  than  the  same  linear  function  of  the  average  indices 
of  the  original  ensembles.  It  can  be  equal  to  it  only  when 
the  original  ensembles  are  similarly  distributed  in  phase. 

Let  P1  j  P% ,  etc.  be  the  probability-coefficients  of  the  original 
ensembles,  and  P  that  of  the  ensemble  formed  by  combining 
them ;  and  let  Nx ,  JV^ ,  etc*  be  the  numbers  of  systems  in  the 
original  ensembles.    It  is  evident  that  we  shall  have 

P  =  cxPx  +  ctP%  +  etc.  =  SfaPO,  (446) 

N  N 

where  *~%W'    *  =  SiP    et°*  ^*46^ 

The  main  proposition  to  be  proved  is  that 
all  an 


J ••-JPlogPdpl...dqm£ir<hf-*fpilogPldI>t...dqnl 
phaaas  L      phaaea  -I 

(447) 
an 

or       C...ft2(elPllogPl)--PlogP]dpl>..dqm>0.    (448) 


If  we  set 

Qi  =  Pi  log  Pi  -  Pi  log  P  -  Px  +  P 

Qx  will  be  positive,  except  when  it  vanishes  for  Px  =  P.  To 
prove  this,  we  may  regard  P1  and  P  as  any  positive  quantities. 
Then 

(SO,-**-** 

\dpx*)P~-px 

Since  Qx  and  dQJdP1  vanish  for  Pt  =  P,  and  the  second 
differential  coefficient  is  always  positive,  Q1  must  be  positive 
except  when  Px  =  P.  Therefore,  if  Q%y  etc.  have  similar 
definitions, 

a(*ifc)2.0.  (449) 
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Butsince  XidPJ^P 

and  S  ei  =  19 

*(*  Qd  =  *(<kI>ilogPd-PlogP.  (460) 

This  proves  (448),  and  shows  that  the  sign  =  will  hold  only 
when 

Pi  =  P,    Pi  =  P,    etc. 

for  all  phases,  t.  e.>  only  when  the  distribution  in  phase  of  the 
original  ensembles  are  all  identical. 

Theorem  IX.  A  uniform  distribution  of  a  given  number  of 
systems  within  given  limits  of  phase  gives  a  less  average  index 
of  probability  of  phase  than  any  other  distribution. 

Let  q  be  the  constant  index  of  the  uniform  distribution,  and 
17  +  Ai;  the  index  of  some  other  distribution.  Since  the  num- 
ber of  systems  within  the  given  limits  is  the  same  in  the  two 
distributions  we  have 

f . . .  /V1"*1  dpx . . .  rffr  a  J. . .  if  dpx . . .  dq%}     (461) 

where  the  integrations,  like  those  which  follow,  are  to  be 
taken  within  the  given  limits.  The  proposition  to  be  proved 
may  be  written 

j...j{ri^^)e^^dpl...dq%>j...Jri^dp1...dq%i  (462) 

or,  since  17  is  constant, 

J     •   J  0?  +  ^)^ndpx  ...dqn  >J.  .  .fr,dPl  ...dqm.      (463) 

In  (451)  also  we  may  cancel  the  constant  factor  e\  and  multiply 
by  the  constant  factor  (17  +  1).    This  gives 

J  •  •  -J  Of  +  l)«A,*>i  •  .  .  dqn  =J  .  .  -J  (v  +  1)  dpi .  •  •  dqn. 

The  subtraction  of  this  equation  will  not  alter  the  inequality 
to  be  proved,  which  may  therefore  be  written 

J  •  •  -J  (*V  -  l)«Alf  <*Pi  •  ••«*?»>/  ...J  -dpx...dq% 
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or  f. .  .f  (A?/*  -e*"+l)cfr1...dqn>  0.         (464) 

Since  the  parenthesis  in  this  expression  represents  a  positive 
value,  except  when  it  vanishes  for  At;  =  0,  the  integral  will 
be  positive  unless  A17  vanishes  everywhere  within  the  limits, 
which  would  make  the  difference  of  the  two  distributions 
vanish.    The  theorem  is  therefore  proved. 
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CHAPTER  XH. 

ON  THE  MOTION  OF  SYSTEMS  AND  ENSEMBLES  OF  SYS- 
TEMS THROUGH  LONG  PERIODS  OF  TIME. 

An  important  question  which  suggests  itself  in  regard  to  any 
case  of  dynamical  motion  is  whether  the  system  considered 
will  return  in  the  course  of  time  to  its  initial  phase,  or,  if  it 
will  not  return  exactly  to  that  phase,  whether  it  will  do  so  to 
any  required  degree  of  approximation  in  the  course  of  a  suffi- 
ciently long  time.  To  be  able  to  give  even  a  partial  answer 
to  such  questions,  we  must  know  something  in  regard  to  the 
dynamical  nature  of  the  system.  In  the  following  theorem, 
the  only  assumption  in  this  respect  is  such  as  we  have  found 
necessary  for  the  existence  of  the  canonical  distribution. 

If  we  imagine  an  ensemble  of  identical  systems  to  be 
distributed  with  a  uniform  density  throughout  any  finite 
extension-in-phase,  the  number  of  the  systems  which  leave 
the  extension-in-phase  and  will  not  return  to  it  in  the  course 
of  time  is  less  than  any  assignable  fraction  of  the  whole 
number;  provided,  that  the  total  extension-in-phase  for  the 
systems  considered  between  two  limiting  values  of  the  energy 
is  finite,  these  limiting  values  being  less  and  greater  respec- 
tively than  any  of  the  energies  of  the  first-mentioned  exten- 
sion-in-phase. 

To  prove  this,  we  observe  that  at  the  moment  which  we 
call  initial  the  systems  occupy  the  given  extension-in-phase. 
It  is  evident  that  some  systems  must  leave  the  extension 
immediately,  unless  all  remain  in  it  forever.  Those  systems 
which  leave  the  extension  at  the  first  instant,  we  shall  call 
ihe  front  of  the  ensemble.  It  will  be  convenient  to  speak  of 
this  front  as  generating  the  extension-in-phase  through  which  it 
passes  in  the  course  of  time,  as  in  geometry  a  surface  is  said  to 
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generate  the  volume  through  which  it  passes.  In  equal  times 
the  front  generates  equal  extensions  in  phase.  This  is  an 
immediate  consequence  of  the  principle  of  conservation  of 
extension-in-phase,  unless  indeed  we  prefer  to  consider  it  as 
a  slight  variation  in  the  expression  of  that  principle.  For  in 
two  equal  short  intervals  of  time  let  the  extensions  generated 
be  A  and  B.  (We  make  the  intervals  short  simply  to  avoid 
the  complications  in  the  enunciation  or  interpretation  of  the 
principle  which  would  arise  when  the  same  extension-in-phase 
is  generated  more  than  once  in  the  interval  considered.)  Now 
if  we  imagine  that  at  a  given  instant  systems  are  distributed 
throughout  the  extension  A,  it  is  evident  that  the  same 
systems  will  after  a  certain  time  occupy  the  extension  2?, 
which  is  therefore  equal  to  A  in  virtue  of  the  principle  cited. 
The  front  of  the  ensemble,  therefore,  goes  on  generating 
equal  extensions  in  equal  times.  But  these  extensions  are 
included  in  a  finite  extension,  viz.,  that  bounded  by  certain 
limiting  values  of  the  energy.  Sooner  or  later,  therefore, 
the  front  must  generate  phases  which  it  has  before  generated. 
Such  second  generation  of  the  same  phases  must  commence 
with  the  initial  phases.  Therefore  a  portion  at  least  of  the 
front  must  return  to  the  original  extension-in-phase.  The 
same  is  of  course  true  of  the  portion  of  the  ensemble  which 
follows  that  portion  of  the  front  through  the  same  phases  at 
a  later  time. 

It  remains  to  consider  how  large  the  portion  of  the  ensemble 
is,  which  will  return  to  the  original  extension-in-phase.  There 
can  be  no  portion  of  the  given  extension-in-phase,  the  systems 
of  which  leave  the  extension  and  do  not  return.  For  we  can 
prove  for  any  portion  of  the  extension  as  for  the  whole,  that 
at  least  a  portion  of  the  systems  leaving  it  will  return. 

We  may  divide  the  given  extension-in-phase  into  parts  as 
follows.  There  may  be  parts  such  that  the  systems  within 
them  will  never  pass  out  of  them.  These  parts  may  indeed 
constitute  the  whole  of  the  given  extension.  But  if  the  given 
extension  is  very  small,  these  parts  will  in  general  be  non- 
existent   There  may  be  parts  such  that  systems  within  them 
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will  all  pass  out  of  the  given  extension  and  all  return  within 
it  The  whole  of  the  given  extension-in-phase  is  made  up  of 
parts  of  these  two  kinds.  This  does  not  exclude  the  possi- 
bility of  phases  on  the  boundaries  of  such  parts,  such  that 
Systems  starting  with  those  phases  would  leave  the  extension 
and  never  return.  But  in  the  supposed  distribution  of  an 
ensemble  of  systems  with  a  uniform  density-in-phase,  such 
systems  would  not  constitute  any  assignable  fraction  of  the 
whole  number. 

These  distinctions  may  be  illustrated  by  a  very  simple 
example.  If  we  consider  the  motion  of  a  rigid  body  of 
which  one  point  is  fixed,  and  which  is  subject  to  no  forces, 
we  find  three  cases.  (1)  The  motion  is  periodic.  (2)  The 
system  will  never  return  to  its  original  phase,  but  will  return 
infinitely  near  to  it.  (8)  The  system  will  never  return  either 
exactly  or  approximately  to  its  original  phase.  But  if  we 
consider  any  extension-in-phase,  however  small,  a  system 
leaving  that  extension  will  return  to  it  except  in  the  case 
called  by  Poinsot  *  singular,'  viz.,  when  the  motion  is  a 
rotation  about  an  axis  lying  in  one  of  two  planes  having 
a  .fixed  position  relative  to  the  rigid  body.  But  all  such 
phases  do  not  constitute  any  true  exUnnon-inrphase  in  the 
sense  in  which  we  have  defined  and  used  the  term.* 

In  the  same  way  it  may  be  proved  that  the  systems  in  a 
canonical  ensemble  which  at  a  given  instant  are  contained 
within  any  finite  extension-in-phase  will  in  general  return  to 

•  An  ensemble  of  systems  distributed  in  phase  is  a  less  simple  and  ele- 
mentary conception  than  a  single  system.  But  by  the  consideration  of 
suitable  ensembles  instead  of  single  systems,  we  may  get  rid  of  the  incon- 
Tenience  of  haying  to  consider  exceptions  formed  by  particular  cases  of  the 
integral  equations  of  motion,  these  cases  simply  disappearing  when  the 
ensemble  is  substituted  for  the  single  system  as  a  subject  of  study.  This 
is  especially  true  when  the  ensemble  is  distributed,  as  in  the  case  called 
canonical,  throughout  an  extension-in-phase.  In  a  less  degree  it  is  true  of 
the  microcanonical  ensemble,  which  does  not  occupy  any  extension-in-phase, 
(in  the  sense  in  which  we  hare  used  the  term,)  although  it  is  conrenient  to 
regard  it  as  a  limiting  case  with  respect  to  ensembles  which  do,  as  we  thus 
gain  for  the  subject  some  part  of  the  analytical  simplicity  which  belongs  to 
the  theory  of  ensembles  which  occupy  true  extensions-in-phase. 
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that  extension-in-phase,  if  they  leave  it,  the  exceptions,  t.  *., 
the  number  which  pass  out  of  the  extension-in-phase  and  do 
not  return  to  it,  being  less  than  any  assignable  fraction  of  the 
whole  number.  In  other  words,  the  probability  that  a  system 
taken  at  random  from  the  part  of  a  canonical  ensemble  which 
is  contained  within  any  given  extension-in-phase,  will  pass  out 
of  that  extension  and  not  return  to  it,  is  zero. 

A  similar  theorem  may  be  enunciated  with  respect  to  a 
microcanonical  ensemble.  Let  us  consider  the  fractional  part 
of  such  an  ensemble  which  lies  within  any  given  limits  of 
phase.  This  fraction  we  shall  denote  by  F.  It  is  evidently 
constant  in  time  since  the  ensemble  is  in  statistical  equi- 
librium. The  systems  within  the  limits  will  not  in  general 
remain  the  same,  but  some  will  pass  out  in  each  unit  of  time 
while  an  equal  number  come  in.  Some  may  pass  out  never 
to  return  within  the  limits.  But  the  number  which  in  any 
time  however  long  pass  out  of  the  limits  never  to  return  will 
not  bear  any  finite  ratio  to  the  number  within  the  limits  at 
a  given  instant.  For,  if  it  were  otherwise,  let  /  denote  the 
fraction  representing  such  ratio  for  the  time  T.  Then,  in 
the  time  7,  the  number  which  pass  out  never  to  return  will 
bear  the  ratio  /  F  to  the  whole  number  in  the  ensemble,  and 
in  a  time  exceeding  T/(fF)  the  number  which  pass  out  of 
the  limits  never  to  return  would  exceed  the  total  number 
of  systems  in  the  ensemble.  The  proposition  is  therefore 
proved. 

This  proof  will  apply  to  the  cases  before  considered,  and 
may  be  regarded  as  more  simple  than  that  which  was  given. 
It  may  also  be  applied  to  any  true  case  of  statistical  equilib- 
rium. By  a  true  case  of  statistical  equilibrium  is  meant  such 
as  may  be  described  by  giving  the  general  value  of  the  prob- 
ability that  an  unspecified  system  of  the  ensemble  is  con- 
tained within  any  given  limits  of  phase.* 

*  An  ensemble  in  which  the  systems  are  material  points  constrained  to 
more  in  rertical  circles,  with  just  enough  energy  to  carry  them  to  the 
highest  points,  cannot  afford  a  true  example  of  statistical  equilibrium.  For 
any  other  ralue  of  the  energy  than  the  critical  ralue  mentioned,  we  might 
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Let  us  next  consider  whether  an  ensemble  of  isolated 
systems  has  any  tendency  in  the  courr.  of  time  toward  a 
state  of  statistical  equilibrium* 

There  are  certain  functions  of  phase  which  are  constant  in 
time.  The  distribution  of  the  ensemble  with  respect  to  the 
values  of  these  functions  is  necessarily  invariable,  that  is, 
the  number  of  systems  within  any  limits  which  can  be 
specified  in  terms  of  these  functions  cannot  vary  in  the  course 
of  time.  The  distribution  in  phase  which  without  violating 
this  condition  gives  the  least  value  of  the  average  index  of 
probability  of  phase  (rj)  is  unique,  and  is  that  in  which  the 

In  various  ways  describe  an  ensemble  in  statistical  equilibrium,  while  the 
same  language  applied  to  the  critical  ralue  of  the  energy  would  fail  to  do 
so.  Thus,  if  we  should  say  that  the  ensemble  is  so  distributed  that  the 
probability  that  a  system  is  in  any  giren  part  of  the  circle  is  proportioned 
to  the  time  which  a  single  system  spends  in  that  part,  motion  in  either  direc- 
tion being  equally  probable,  we  should  perfectly  define  a  distribution  in  sta- 
tistical equilibrium  for  any  ralue  of  the  energy  except  the  critical  ralue 
mentioned  abore,  but  for  this  ralue  of  the  energy  all  the  probabilities  in 
question  would  ranish  unless  the  highest  point  is  included  in  the  part  of  the 
.circle  considered,  in  which  case  the  probability  is  unity,  or  forms  one  of  its 
limits,  in  which  case  the  probability  is  indeterminate.  Compare  the  foot-note 
on  page  118. 

A  still  more  simple  example  is  afforded  by  the  uniform  motion  of  a 
material  point  in  a  straight  line.  Here  the  impossibility  of  statistical  equi- 
librium is  not  limited  to  any  particular  energy,  and  the  canonical  distribu- 
tion as  weU  as  the  microcanonical  is  impossible. 

These  examples  are  mentioned  here  in  order  to  show  the  necessity  of 
caution  in  the  application  of  the  abore  principle,  with  respect  to  the  question 
whether  we  hare  to  do  with  a  true  case  of  statistical  equilibrium. 

Another  point  in  respect  to  which  caution  must  be  exercised  is  that  the 
part  of  an  ensemble  of  which  the  theorem  of  the  return  of  systems  is  asserted 
should  be  entirely  defined  by  limits  within  which  it  is  contained,  and  not  by 
any  such  condition  as  that  a  certain  function  of  phase  shall  hare  a  giren 
ralue.  This  is  necessary  in  order  that  the  part  of  the  ensemble  which  is 
considered  should  be  any  assignable  fraction  of  the  whole.  Thus,  if  we  hare 
a  canonical  ensemble  consisting  of  material  points  in  rertical  circles,  the 
theorem  of  the  return  of  systems  may  be  applied  to  a  part  of  the  ensemble 
defined  as  contained  in  a  giren  part  of  the  circle.  But  it  may  not  be  applied 
in  all  cases  to  a  part  of  the  ensemble  defined  as  contained  in  a  giren  part 
of  the  circle  and  haying  a  giren  energy.  It  would,  in  fact,  express  the  exact 
opposite  of  the  truth  when  the  giren  energy  is  the  critical  ralue  mentioned 
abore. 
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index  of  probability  (rf)  is  a  function  of  the  functions  men- 
tioned.* It  is  therefore  a  permanent  distribution,  f  and  the 
only  permanent  distribution  consistent  with  the  invariability 
of  the  distribution  with  respect  to  the  functions  of  phase 
which  are  constant  in  time. 

It  would  seem,  therefore,  that  we  might  find  a  sort  of  meas- 
ure of  the  deviation  of  an  ensemble  from  statistical  equilibrium 
in  the  excess  of  the  average  index  above  the  minimum  which  is 
consistent  with  the  condition  of  the  invariability  of  the  distri- 
bution with  respect  to  the  constant  functions  of  phase.  But 
we  have  seen  that  the  index  of  probability  is  constant  in  time 
for  each  system  of  the  ensemble.  The  average  index  is  there- 
fore constant,  and  we  find  by  this  method  no  approach  toward 
statistical  equilibrium  in  the  course  of  time. 

Yet  we  must  here  exercise  great  caution.  One  function 
may  approach  indefinitely  near  to  another  function,  while* 
some  quantity  determined  by  the  first  does  not  approach  the 
corresponding  quantity  determined  by  the  second.  A  line 
joining  two  points  may  approach  indefinitely  near  to  the 
straight  line  joining  them,  while  its  length  remains  constant. 
We  may  find  a  closer  analogy  with  the  case  under  considera- 
tion in  the  effect  of  stirring  an  incompressible  liquid.^  In 
space  of  2n  dimensions  the  case  might  be  made  analyti- 
cally identical  with  that  of  an  ensemble  of  systems  of  n 
degrees  of  freedom,  but  the  analogy  is  perfect  in  ordinary 
space.  Let  us  suppose  the  liquid  to  contain  a  certain  amount 
of  coloring  matter  which  does  not  affect  its  hydrodynamic 
properties.  Now  the  state  in  which  the  density  of  the  coloring 
matter  is  uniform,  i.  e.,  the  state  of  perfect  mixture,  which  is 
a  sort  of  state  of  equilibrium  in  this  respect  that  the  distribu- 
tion of  the  coloring  matter  in  space  is  not  affected  by  the 
internal  motions  of  the  liquid,  is  characterized  by  a  minimum 

*  See  Chapter  XI,  Theorem  IV. 

t  See  Chapter  IV,  $ub  init, 

t  By  liquid  is  here  meant  the  continuous  body  of  theoretical  hydrody- 
namics, and  not  anything  of  the  molecular  structure  and  molecular  motions 
of  real  liquids. 
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value  of  the  average  square  of  the  density  of  the  coloring 
matter.  Let  us  suppose,  however,  that  the  coloring  matter  is 
distributed  with  a  variable  density.  If  we  give  the  liquid  any 
motion  whatever,  subject  only  to  the  hydrodynamic  law  of 
incompressibility,  —  it  may  be  a  steady  flux,  or  it  may  vary 
with  the  time, — the  density  of  the  coloring  matter  at  any 
same  point  of  the  liquid  will  be  unchanged,  and  the  average 
square  of  this  density  will  therefore  be  unchanged.  Yet  no 
fact  is  more  familiar  to  us  than  that  stirring  tends  to  bring  a 
liquid  to  a  state  of  uniform  mixture,  or  uniform  densities  of 
its  components,  which  is  characterized  by  minimum  values 
of  the  average  squares  of  these  densities.  It  is  quite  true  that 
in  the  physical  experiment  the  result  is  hastened  by  the 
process  of  diffusion,  but  the  result  is  evidently  not  dependent 
on  that  process. 

The  contradiction  is  to  be  traced  to  the  notion  of  the  dentity 
of  the  coloring  matter,  and  the  process  by  which  this  quantity 
is  evaluated.  This  quantity  is  the  limiting  ratio  of  the 
quantity  of  the  coloring  matter  in  an  element  of  space  to  the 
volume  of  that  element.  Now  if  we  should  take  for  our  ele- 
ments of  volume,  after  any  amount  of  stirring,  the  spaces 
occupied  by  the  same  portions  of  the  liquid  which  originally 
occupied  any  given  system  of  elements  of  volume,  the  densi- 
ties of  the  coloring  matter,  thus  estimated,  would  be  identical 
with  the  original  densities  as  determined  by  the  given  system 
of  elements  of  volume.  Moreover,  if  at  the  end  of  any  finite 
amount  of  stirring  we  should  take  our  elements  of  volume  in 
any  ordinary  form  but  sufficiently  small,  the  average  square 
of  the  density  of  the  coloring  matter,  as  determined  by  such 
element  of  volume,  would  approximate  to  any  required  degree 
to  its  value  before  the  stirring.  But  if  we  take  any  element 
of  space  of  fixed  position  and  dimensions,  we  may  continue 
the  stirring  so  long  that  the  densities  of  the  colored  liquid 
estimated  for  these  fixed  elements  will  approach  a  uniform 
limit,  viz.,  that  of  perfect  mixture. 

The  case  is  evidently  one  of  those  in  which  the  limit  of  a 
limit  has  different  values,  according  to  the  order  in  which  we 
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apply  the  processes  of  taking  a  limit.  If  treating  the  elements 
of  volume  as  constant,  we  continue  the  stirring  indefinitely, 
we  get  a  uniform  density,  a  result  not  affected  by  making  the 
elements  as  small  as  we  choose ;  but  if  treating  the  amount  of 
stirring  as  finite,  we  diminish  indefinitely  the  elements  of 
volume,  we  get  exactly  the  same  distribution  in  density  as 
before  the  stirring,  a  result  which  is  not  affected  by  con- 
tinuing the  stirring  as  long  as  we  choose.  The  question  is 
largely  one  of  language  and  definition.  One  may  perhaps  be 
allowed  to  say  that  a  finite  amount  of  stirring  will  not  affect 
the  mean  square  of  the  density  of  the  coloring  matter,  but  an 
infinite  amount  of  stirring  may  be  regarded  as  producing  a 
condition  in  which  the  mean  square  of  the  density  has  its 
minimum  value,  and  the  density  is  uniform.  We  may  cer- 
tainly say  that  a  sensibly  uniform  density  of  the  colored  com- 
ponent may  be  produced  by  stirring.  Whether  the  time 
required  for  this  result  would  be  long  or  short  depends  upon 
the  nature  of  the  motion  given  to  the  liquid,  and  the  fineness 
of  our  method  of  evaluating  the  density. 

All  this  may  appear  more  distinctly  if  we  consider  a  special 
case  of  liquid  motion.  Let  us  imagine  a  cylindrical  mass  of 
liquid  of  which  one  sector  of  90°  is  black  and  the  rest  white. 
Let  it  have  a  motion  of  rotation  about  the  axis  of  the  cylinder 
in  which  the  angular  velocity  is  a  function  of  the  distance 
from  the  axis.  In  the  course  of  time  the  black  and  the  white 
parts  would  become  drawn  out  into  thin  ribbons,  which  would 
be  wound  spirally  about  the  axis.  The  thickness  of  these  rib- 
bons would  diminish  without  limit,  and  the  liquid  would  there- 
fore tend  toward  a  state  of  perfect  mixture  of  the  black  and 
white  portions.  That  is,  in  any  given  element  of  space,  the 
proportion  of  the  black  and  white  would  approach  1 : 3  as  a  limit. 
Yet  after  any  finite  time,  the  total  volume  would  be  divided 
into  two  parts,  one  of  which  would  consist  of  the  white  liquid 
exclusively,  and  the  other  of  the  black  exclusively.  If  the 
coloring  matter,  instead  of  being  distributed  initially  with  a 
uniform  density  throughout  a  section  of  the  cylinder,  were 
distributed  with  a  density  represented  by  any  arbitrary  f unc- 
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tion  of  the  cylindrical  coordinates  r,  6  and  z,  the  effect  of  the 
same  motion  continued  indefinitely  would  be  an  approach  to 
a  condition  in  which  the  density  is  a  function  of  r  and  z  alone. 
In  this  limiting  condition,  the  average  square  of  the  density 
would  be  less  than  in  the  original  condition,  when  the  density 
was  supposed  to  vary  with  <?,  although  after  any  finite  time 
the  average  square  of  the  density  would  be  the  same  as  at 
first 

If  we  limit  our  attention  to  the  motion  in  a  single  plane 
perpendicular  to  the  axis  of  the  cylinder,  we  have  something 
which  is  almost  identical  with  a  diagrammatic  representation 
of  the  changes  in  distribution  in  phase  of  an  ensemble  of 
systems  of  one  degree  of  freedom,  in  which  the  motion  is 
periodic,  the  period  varying  with  the  energy,  as  in  the  case  of 
a  pendulum  swinging  in  a  circular  arc.  If  the  coordinates 
and  momenta  of  the  systems  are  represented  by  rectangu- 
lar coordinates  in  the  diagram,  the  points  in  the  diagram 
representing  the  changing  phases  of  moving  systems,  will 
move  about  the  origin  in  closed  curves  of  constant  energy. 
The  motion  will  be  such  that  areas  bounded  by  points  repre- 
senting moving  systems  will  be  preserved.  The  only  differ- 
ence between  the  motion  of  the  liquid  and  the  motion  in  the 
diagram  is  that  in  one  case  the  paths  are  circular,  and  in  the 
other  they  differ  more  or  less  from  that  form. 

When  the  energy  is  proportional  to  p*  +  <f  the  curves  of 
constant  energy  are  circles,  and  the  period  is  independent  of 
the  energy.  There  is  then  no  tendency  toward  a  state  of  sta- 
tistical equilibrium.  The  diagram  turns  about  the  origin  with- 
out change  of  form.  This  corresponds  to  the  case  of  liquid 
motion,  when  the  liquid  revolves  with  a  uniform  angular 
velocity  like  a  rigid  solid. 

The  analogy  between  the  motion  of  an  ensemble  of  systems 
in  an  extension-in-phase  and  a  steady  current  in  an  incompres- 
sible liquid,  and  the  diagrammatic  representation  of  the  case 
of  one  degree  of  freedom,  which  appeals  to  our  geometrical  in- 
tuitions, may  be  sufficient  to  show  how  the  conservation  of 
density  in  phase,  which  involves  the  conservation  of  the 
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average  value  of  the  index  of  probability  of  phase,  is  consist- 
ent with  an  approach  to  a  limiting  condition  in  which  that 
average  value  is  less.  We  might  perhaps  fairly  infer  from 
such  considerations  as  have  been  adduced  that  an  approach 
to  a  limiting  condition  of  statistical  equilibrium  is  the  general 
rule,  when  the  initial  condition  is  not  of  that  character.  But 
the  subject  is  of  such  importance  that  it  seems  desirable  to 
give  it  farther  consideration. 

Let  us  suppose  that  the  total  extension-in-phase  for  the 
kipd  of  system  considered  to  be  divided  into  equal  elements 
(2)F")  which  are  very  small  but  not  infinitely  small.  Let  us 
imagine  an  ensemble  of  systems  distributed  in  this  extension 
in  a  manner  represented  by  the  index  of  probability  17,  which 
is  an  arbitrary  function  of  the  phase  subject  only  to  the  re- 
striction expressed  by  equation  (46)  of  Chapter  I.  We  shall 
suppose  the  elements  DVto  be  so  small  that  q  may  in  gen- 
eral be  regarded  as  sensibly  constant  within  any  one  of  them 
at  the  initial  moment.  Let  the  path  of  a  system  be  defined  as 
the  series  of  phases  through  which  it  passes. 

At  the  initial  moment  (tT)  a  certain  system  is  in  an  element 
of  extension  DV.  Subsequently,  at  the  time  £",  the  same 
system  is  in  the  element  D  V,f.  Other  systems  which  were 
at  first  in  DV  will  at  the  time  *"  be  in  2>F",  but  not  all, 
probably.  The  systems  which  were  at  first  in  BV  wiH  at 
the  time  tn  occupy  an  extension-in-phase  exactly  as  laTge  as  at 
first.  But  it  will  probably  be  distributed  among  a  very  great 
number  of  the  elements  (-DJO  into  which  we  have  divided 
the  total  extension-in-phase.  If  it  is  not  so,  we  can  generally 
take  a  later  time  at  which  it  will  be  so.  There  will  be  excep- 
tions to  this  for  particular  laws  of  motion,  but  we  will  con- 
fine ourselves  to  what  may  fairly  be  called  the  general  case. 
Only  a  very  small  part  of  the  systems  initially  in  D  V1  will 
be  found  in  DV"  at  the  time  t",  and  those  which  are  found  in 
DVn  at  that  time  were  at  the  initial  moment  distributed 
among  a  very  large  number  of  elements  D  V. 

What  is  important  for  our  purpose  is  the  value  of  77,  the 
index  of  probability  of  phase  in  the  element  DVff  at  the  time 
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If9.     In  the  part  of  DV"  occupied  by  systems  which  at  the 
time  H  were  in  DV9  the  value  of  v  will  be  the  same  as  its 
value  in  DV  at  the  time  f,  which  we  shall  call  1/.     In  the 
parts  of  DV99  occupied  by  systems  which  at  f  were  in  ele- 
ments very  near  to  D  V  we  may  suppose  the  value  of  17  to 
vary  little  from  rf.    We  cannot  assume  this  in  regard  to  parts 
of  DV"  occupied  by  systems  which  at  f  were  in  elements 
remote  from  DV.     We  want,  therefore,  some  idea  of  the 
nature  of  the  extension-in-phase  occupied  at  H  by  the  sys- 
tems which  at  t"  will  occupy  D  V9.     Analytically,  the  prob- 
lem is  identical  with  finding  the  extension  occupied  at  t" 
by  the  systems  which  at  If  occupied  DV1.     Now  the  systems 
in  DV"  which  lie  on  the  same  path  as  the  system  first  con- 
sidered, evidently  arrived  at  DV"  at  nearly  the  same  time, 
and  must  have  left  D  V  at  nearly  the  same  time,  and  there- 
fore at  if  were  in  or  near  D  V.     We  may  therefore  take  V  as 
the  value  for  these  systems.    The  same  essentially  is  true  of 
systems  in  DV11  which  lie  on  paths  very  close  to  the  path 
already  considered.    But  with  respect  to  paths  passing  through 
DV  and  DV9,  but  not  so  close  to  the  first  path,  we  cannot 
assume  that  the  time  required  to  pass  from  DV  to  D V"  is 
nearly  the  same  as  for  the  first  path.    The  difference  of  the 
times  required  may  be  small  in  comparison  with  <"-*',  but  as 
this  interval  can  be  as  large  as  we  choose,  the  difference  of  the 
times  required  in  the  different  paths  has  no  limit  w  its  pos- 
sible value.    Now  if  the  case  were  one  of  statistical  equilib- 
rium, the  value  of  1/  would  be  constant  in  any  path,  and  if  all 
the  paths  which  pass  through  DVf  also  pass  through  or  near 
DV\  the  value  of  y  throughout  DV"  will  vary  little  from 
tf'.    But  when  the  case  is  not  one  of  statistical  equilibrium, 
we  cannot  draw  any  such  conclusion.    The  only  conclusion 
which  we  can  draw  with  respect  to  the  phase  at  t9  of  the  sys- 
tems which  at  t99  are  in  D  V1  is  that  they  are  nearly  on  the 
same  path. 

Now  if  we  should  make  a  new  estimate  of  indices  of  prob- 
ability of  phase  at  the  time  *",  using  for  this  purpose  the 
elements  DV,  —  that  is,  if  we  should  divide  the  number  of 
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systems  in  D  V",  for  example,  by  the  total  number  of  systems, 
and  also  by  the  extension-in-phase  of  the  element,  and  take 
the  logarithm  of  the  quotient,  we  would  get  a  number  which 
would  be  less  than  the  average  value  of  17  for  the  systems 
within  D  V"  based  on  the  distribution  in  phase  at  the  time  f.* 
Hence  the  average  value  of  17  for  the  whole  ensemble  of 
systems  based  on  the  distribution  at  t"  will  be  'less  than  the 
average  value  based  on  the  distribution  at  t\ 

We  must  not  forget  that  there  are  exceptions  to  this  gen- 
eral rule.  These  exceptions  are  in  cases  in  which  the  laws 
of  motion  are  such  that  systems  having  small  differences 
of  phase  will  continue  always  to  have  small  differences  of 
phase. 

It  is  to  be  observed  that  if  the  average  index  of  probability  in 
an  ensemble  may  be  said  in  some  sense  to  have  a  less  value  at 
one  time  than  at  another,  it  is  not  necessarily  priority  in  time 
which  determines  the  greater  average  index.  If  &  distribution, 
which  is  not  one  of  statistical  equilibrium,  should  be  given 
for  a  time  t\  and  the  distribution  at  an  earlier  time  t"  should 
be  defined  as  that  given  by  the  corresponding  phases,  if  we 
increase  the  interval  leaving  t!  fixed  and  taking  t,f  at  an  earlier 
and  earlier  date,  the  distribution  at  tn  will  in  general  approach 
a  limiting  distribution  which  is  in  statistical  equilibrium.  The 
determining  difference  in  such  cases  is  that  between  a  definite 
distribution  at  a  definite  time  and  the  limit  of  a  varying  dis- 
tribution when  the  moment  considered  is  carried  either  forward 
or  backward  indefinitely.! 

But  while  the  distinction  of  prior  and  subsequent  events 
may  be  immaterial  with  respect  to  mathematical  fictions,  it  is 
quite  otherwise  with  respect  to  the  events  of  the  real  world. 
It  should  not  be  forgotten,  when  our  ensembles  are  chosen  to 
illustrate  the  probabilities  of  events  in  the  real  world,  that 

*  See  Chapter  XI,  Theorem  IX. 

t  One  may  compare  the  kinematical  truism  that  when  two  points  are 
moving  with  uniform  velocities,  (with  the  single  exception  of  the  case  where 
the  relative  motion  is  zero,)  their  mutual  distance  at  any  definite  time  is  less 
than  f  or  t  =  00 ,  or  t  =  —  00 . 
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while  the  probabilities  of  subsequent  events  may  often  be 
determined  from  the  probabilities  of  prior  events,  it  is  rarely 
the  case  that  probabilities  of  prior  events  can  be  determined 
from  those  of  subsequent  events,  for  we  are  rarely  justified  in 
excluding  the  consideration  of  the  antecedent  probability  of 
the  prior  events. 

It  is  worthy  of  notice  that  to  take  a  system  at  random  from 
an  ensemble  at  a  date  chosen  at  random  from  several  given 
dates,  t',  t",  etc.,  is  practically  the  same  thing  as  to  take  a  sys- 
tem at  random  from  the  ensemble  composed  of  all  the  systems 
of  the  given  ensemble  in  their  phases  at  the  time  t!,  together 
with  the  same  systems  in  their  phases  at  the  time  t",  etc.  By 
Theorem  VIII  of  Chapter  XI  this  will  give  an  ensemble  in 
which  the  average  index  of  probability  will  be  less  than  in 
the  given  ensemble,  except  in  the  case  when  the  distribution 
in  the  given  ensemble  is  the  same  at  the  times  t',  t",  etc. 
Consequently,  any  indefiniteness  in  the  time  in  which  we  take 
a  system  at  random  from  an  ensemble  has  the  practical  effect 
of  diminishing  the  average  index  of  the  ensemble  from  which 
the  system  may  be  supposed  to  be  drawn,  except  when  the 
given  ensemble  is  in  statistical  equilibrium. 
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CHAPTER  Xm. 

EFFECT   OF  VARIOUS   PROCESSES  ON  AN  ENSEMBLE  OF 

SYSTEMS. 

In  the  last  chapter  and  in  Chapter  I  we  have  considered  the 
changes  which  take  place  in  the  course  of  time  in  an  ensemble 
of  isolated  systems.  Let  us  now  proceed  to  consider  the 
changes  which  will  take  place  in  an  ensemble  of  systems  under 
external  influences.  These  external  influences  will  be  of  two 
kinds,  the  variation  of  the  coordinates  which  we  have  called 
external,  and  the  action  of  other  ensembles  of  systems.  The 
essential  difference  of  the  two  kinds  of  influence  consists  in 
this,  that  the  bodies  to  which  the  external  coordinates  relate 
are  not  distributed  in  phase,  while  in  the  case  of  interaction 
of  the  systems  of  two  ensembles,  we  have  to  regard  the  fact 
that  both  are  distributed  in  phase.  To  find  the  effect  pro- 
duced on  the  ensemble  with  which  we  are  principally  con- 
cerned, we  have  therefore  to  consider  single  values  of  what 
we  have  called  external  coordinates,  but  an  infinity  of  values 
of  the  internal  coordinates  of  any  other  ensemble  with  which 
there  is  interaction. 

Or,  —  to  regard  the  subject  from  another  point  of  view,  — 
the  action  between  an  unspecified  system  of  an  ensemble  and 
the  bodies  represented  by  the  external  coordinates,  is  the 
action  between  a  system  imperfectly  determined  with  respect 
to  phase  and  one  which  is  perfectly  determined;  while  the 
interaction  between  two  unspecified  systems  belonging  to 
different  ensembles  is  the  action  between  two  systems  both  of 
which  are  imperfectly  determined  with  respect  to  phase.* 

We  shall  suppose  the  ensembles  which  we  consider  to  be 
distributed  in  phase  in  the  manner  described  in  Chapter  I,  and 

*  In  the  development  of  the  subject,  we  shall  find  that  this  distinction 
corresponds  to  the  distinction  in  thermodynamics  between  mechanical  and 
thermal  action. 
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represented  by  the  notations  of  that  chapter,  especially  by  the 
index  of  probability  of  phase  (17).  There  are  therefore  2  n 
independent  variations  in  the  phases  which  constitute  the 
ensembles  considered.  This  excludes  ensembles  like  the 
microcanonical,  in  which,  as  energy  is  constant,  there  are 
only  2n  — 1  independent  variations  of  phase.  This  seems 
necessary  for  the  purposes  of  a  general  discussion.  For 
although  we  may  imagine  a  microcanonical  ensemble  to  have 
a  permanent  existence  when  isolated  from  external  influences, 
the  effect  of  such  influences  would  generally  be  to  destroy  the 
uniformity  of  energy  in  the  ensemble.  Moreover,  since  the 
microcanonical  ensemble  may  be  regarded  as  a  limiting  case  of 
such  ensembles  as  are  described  in  Chapter  I,  (and  that  in 
more  than  one  way,  as  shown  in  Chapter  X,)  the  exclusion  is 
rather  formal  than  real,  since  any  properties  which  belong  to 
the  microcanonical  ensemble  could  easily  be  derived  from  those 
of  the  ensembles  of  Chapter  I,  which  in  a  certain  sense  may 
be  regarded  as  representing  the  general  case. 

Let  us  first  consider  the  effect  of  variation  of  the  external 
coordinates.  We  have  already  had  occasion  to  regard  these 
quantities  as  variable  in  the  differentiation  of  certain  equations 
relating  to  ensembles  distributed  according  to  certain  laws 
called  canonical  or  microcanonical.  That  variation  of  the 
external  coordinates  was,  however,  only  carrying  the  atten- 
tion of  the  mind  from  an  ensemble  with  certain  values  of  the 
external  coordinates,  and  distributed  in  phase  according  to 
some  general  law  depending  upon  those  values,  to  another 
ensemble  with  different  values  of  the  external  coordinates,  and 
with  the  distribution  changed  to  conform  to  these  new  values. 

What  we  have  now  to  consider  is  the  effect  which  would 
actually  result  in  the  course  of  time  in  an  ensemble  of  systems 
in  which  the  external  coordinates  should  be  varied  in  any 
arbitrary  manner.  Let  us  suppose,  in  the  first  place,  that 
these  coordinates  are  varied  abruptly  at  a  given  instant,  being 
constant  both  before  and  after  that  instant.  By  the  definition 
of  the  external  coordinates  it  appears  that  this  variation  does 
not  affect  the  phase  of  any  system  of  the  ensemble  at  the  time 
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when  it  takes  place.  Therefore  it  does  not  affect  the  index  of 
probability  of  phase  (17)  of  any  system,  or  the  average  value 
of  the  index  (rj)  at  that  time.  And  if  these  quantities  are 
constant  in  time  before  the  variation  of  the  external  coordi- 
nates, and  after  that  variation,  their  constancy  in  time  is  not 
interrupted  by  that  variation.  In  feet,  in  the  demonstration 
of  the  conservation  of  probability  of  phase  in  Chapter  I,  the 
variation  of  the  external  coordinates  was  not  excluded. 

But  a  variation  of  the  external  coordinates  will  in  general 
disturb  a  previously  existing  state  of  statistical  equilibrium. 
For,  although  it  does  not  affect  (at  the  first  instant)  the 
distribution-in-phase,  it  does  affect  the  condition  necessary  for 
equilibrium.  This  condition,  as  we  have  seen  in  Chapter  IV, 
is  that  the  index  of  probability  of  phase  shall  be  a  function  of 
phase  which  is  constant  in  time  for  moving  systems.  Now 
a  change  in  the  external  coordinates,  by  changing  the  forces 
which  act  on  the  systems,  will  change  the  nature  of  the 
functions  of  phase  which  are  constant  in  time.  Therefore, 
the  distribution  in  phase  which  was  one  of  statistical  equi- 
librium for  the  old  values  of  the  external  coordinates,  will  not 
be  such  for  the  new  values. 

Now  we  have  seen,  in  the  last  chapter,  that  when  the  dis- 
tribution-in-phase  is  not  one  of  statistical  equilibrium,  an 
ensemble  of  systems  may,  and  in  general  will,  after  a  longer  or 
shorter  time,  come  to  a  state  which  may  be  regarded,  if  very 
small  differences  of  phase  are  neglected,  as  one  of  statistical 
equilibrium,  and  in  which  consequently  the  average  value  of 
the  index  (ji)  is  less  than  at  first  It  is  evident,  therefore, 
that  a  variation  of  the  external  coordinates,  by  disturbing  a 
state  of  statistical  equilibrium,  may  indirectly  cause  a  diminu- 
tion, (in  a  certain  sense  at  least,)  of  the  value  of  rj. 

But  if  the  change  in  the  external  coordinates  is  very  small, 
the  change  in  the  distribution  necessary  for  equilibrium  will 
in  general  be  correspondingly  small.  Hence,  the  original  dis- 
tribution in  phase,  since  it  differs  little  from  one  which  would 
be  in  statistical  equilibrium  with  the  new  values  of  the  ex- 
ternal coordinates,  may  be  supposed  to  have  a  value  of  v 
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which  differs  by  a  small  quantity  of  the  second  order  from 
the  minimum  value  which  characterizes  the  state  of  statistical 
equilibrium.  And  the  diminution  in  the  average  index  result- 
ing in  the  course  of  time  from  the  very  small  change  in  the 
external  coordinates,  cannot  exceed  this  small  quantity  of 
the  second  order. 

Hence  also,  if  the  change  in  the  external  coordinates  of  an 
ensemble  initially  in  statistical  equilibrium  consists  in  suc- 
cessive very  small  changes  separated  by  very  long  intervals  of 
time  in  which  the  disturbance  of  statistical  equilibrium  be- 
comes sensibly  effaced,  the  final  diminution  in  the  average 
index  of  probability  will  in  general  be  negligible,  although  the 
total  change  in  the  external  coordinates  is  large.  The  result 
will  be  the  same  if  the  change  in  the  external  coordinates 
takes  place  continuously  but  sufficiently  slowly. 

Even  in  cases  in  which  there  is  no  tendency  toward  the 
restoration  of  statistical  equilibrium  in  the  lapse  of  time,  a  varia- 
tion of  external  coordinates  which  would  cause,  if  it  took 
place  in  a  short  time,  a  great  disturbance  of  a  previous  state 
of  equilibrium,  may,  if  sufficiently  distributed  in  time,  produce 
no  sensible  disturbance  of  the  statistical  equilibrium. 

Thus,  in  the  case  of  three  degrees  of  freedom,  let  the  systems 
be  heavy  points  suspended  by  elastic  massless  cords,  and  let  the 
ensemble  be  distributed  in  phase  with  a  density  proportioned 
to  some  function  of  the  energy,  and  therefore  in  statistical  equi- 
librium. For  a  change  in  the  external  coordinates,  we  may 
take  a  horizontal  motion  of  the  point  of  suspension.  If  this 
is  moved  a  given  distance,  the  resulting  disturbance  of  the 
statistical  equilibrium  may  evidently  be  diminished  indefi- 
nitely by  diminishing  the  velocity  of  the  point  of  suspension. 
This  will  be  true  if  the  law  of  elasticity  of  the  string  is  such 
that  the  period  of  vibration  is  independent  of  the  energy,  in 
which  case  there  is  no  tendency  in  the  course  of  time  toward 
a  state  of  statistical  equilibrium,  as  well  as  in  the  more  general 
case,  in  which  there  is  a  tendency  toward  statistical  equilibrium. 

That  something  of  this  kind  will  be  true  in  general,  the 
following  considerations  will  tend  to  show. 
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We  define  a  path  as  the  series  of  phases  through  which  a 
system  passes  in  the  course  of  time  when  the  external  co- 
ordinates have  fixed  values.  When  the  external  coordinates 
are  varied,  paths  are  changed.  The  path  of  a  phase  is  the 
path  to  which  that  phase  belongs.  With  reference  to  any 
ensemble  of  systems  we  shall  denote  by  U\p  the  average  value 
of  the  density-in-phase  in  a  path.  This  implies  that  we  have 
a  measure  for  comparing  different  portions  of  the  path.  We 
shall  suppose  the  time  required  to  traverse  any  portion  of  a 
path  to  be  its  measure  for  the  purpose  of  determining  this 
average. 

With  this  understanding,  let  us  suppose  that  a  certain  en- 
semble is  in  statistical  equilibrium.  In  every  element  of 
extension-in-phase,  therefore,  the  density-in-phase  D  is  equal 
to  its  path-average  2J]p.  Let  a  sudden  small  change  be  made 
in  the  external  coordinates.  The  statistical  equilibrium  will  be 
disturbed  and  we  shall  no  longer  have  D  =  ~B\P  everywhere. 
This  is  not  because  2>  is  changed,  but  because  ~D\P  is  changed, 
the  paths  being  changed.  It  is  evident  that  if  D  >  U\p  in 
a  part  of  a  path,  we  shall  have  D  <~E\9  in  other  parts  of  the 
same  path. 

Now,  if  we  should  imagine  a  further  change  in  the  external 
coordinates  of  the  same  kind,  we  should  expect  it  to  produce 
an  effect  of  the  same  kind.  But  the  manner  in  which  the 
second  effect  will  be  superposed  on  the  first  will  be  different, 
according  as  it  occurs  immediately  after  the  first  change  or 
after  an  interval  of  time.  If  it  occurs  immediately  after  the 
first  change,  then  in  any  element  of  phase  in  which  the  first 
change  produced  a  positive  value  of  D  -  U\p  the  second  change 
will  add  a  positive  value  to  the  first  positive  value,  and  where 
D  -  2J]p  was  negative,  the  second  change  will  add  a  negative 
value  to  the  first  negative  value. 

But  if  we  wait  a  sufficient  time  before  making  the  second 
change  in  the  external  coordinates,  so  that  systems  have 
passed  from  elements  of  phase  in  which  D  -  TJ\P  was  origi- 
nally positive  to  elements  in  which  it  was  originally  negative,, 
and  vice  ver$a,  (the  systems  carrying  with  them  the  values 
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of  D  -  3?lp0  the  positive  values  of  D  -  27JP  caused  by  the 
second  change  will  be  in  part  superposed  on  negative  values 
due  to  the  first  change,  and  vice  vena. 

The  disturbance  of  statistical  equilibrium,  therefore,  pro- 
duced by  a  given  change  in  the  values  of  the  external  co- 
ordinates may  be  very  much  diminished  by  dividing  the 
change  into  two  parts  separated  by  a  sufficient  interval  of 
time,  and  a  sufficient  interval  of  time  for  this  purpose  is  one 
in  which  the  phases  of  the  individual  systems  are  entirely 
unlike  the  first,  so  that  any  individual  system  is  differently 
affected  by  the  change,  although  the  whole  ensemble  is  af- 
fected in  nearly  the  same  way.  Since  there  is  no  limit  to  the 
diminution  of  the  disturbance  of  equilibrium  by  division  of 
the  change  in  the  external  coordinates,  we  may  suppose  as 
a  general  rule  that  by  diminishing  the  velocity  of  the  changes 
in  the  external  coordinates,  a  given  change  may  be  made  to 
produce  a  very  small  disturbance  of  statistical  equilibrium. 

If  we  write  rf  for  the  value  of  the  average  index  of  probability 
before  the  variation  of  the  external  coordinates,  and  rj"  for  the 
value  after  this  variation,  we  shall  have  in  any  case 

Zf'  <  Zf 
V    =  V 

as  the  simple  result  of  the  variation  of  the  external  coordi- 
nates. This  may  be  compared  with  the  thermodynamic  the- 
orem that  the  entropy  of  a  body  cannot  be  diminished  by 
mechanical  (as  distinguished  from  thermal)  action.* 

If  we  have  (approximate)  statistical  equilibrium  between 
the  times  if  and  if '  (corresponding  to  r{  and  17"),  we  shall  have 
approximately 

v  =  v"> 
which  may  be  compared  with  the  thermodynamic  theorem  that 
the  entropy  of  a  body  is  not  (sensibly)  affected  by  mechanical 
action,  during  which  the  body  is  at  each  instant  (sensibly)  in 
a  state  of  thermodynamic  equilibrium. 

Approximate  statistical  equilibrium  may  usually  be  attained 

*  The  correspondences  to  which  the  reader's  attention  is  called  are  between 
—  if  and  entropy,  and  between  8  and  temperature. 
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by  a  sufficiently  slow  variation  of  the  external  coordinates, 
just  as  approximate  thermodynamic  equilibrium  may  usually 
be  attained  by  sufficient  slowness  in  the  mechanical  operations 
to  which  the  body  is  subject. 

We  now  pass  to  the  consideration  of  the  effect  on  an  en- 
semble of  systems  which  is  produced  by  the  action  of  other 
ensembles  with  which  it  is  brought  into  dynamical  connec- 
tion. In  a  previous  chapter  *  we  have  imagined  a  dynamical 
connection  arbitrarily  created  between  the  systems  of  two 
ensembles.  We  shall  now  regard  the  action  between  the 
systems  of  the  two  ensembles  as  a  result  of  the  variation 
of  the  external  coordinates,  which  causes  such  variations 
of  the  internal  coordinates  as  to  bring  the  systems  of  the 
two  ensembles  within  the  range  of  each  other's  action. 

Initially,  we  suppose  that  we  have  two  separate  ensembles 
of  systems,  Ex  and  Ev  The  numbers  of  degrees  of  freedom 
of  the  systems  in  the  two  ensembles  will  be  denoted  by  14  and 
Ha  respectively,  and  the  probability-coefficients  by  e&  and  e*». 
Now  we  may  regard  any  system  of  the  first  ensemble  com- 
bined with  any  system  of  the  second  as  forming  a  single 
system  of  74  +  w,  degrees  of  freedom.  Let  us  consider  the 
ensemble  (JSfw)  obtained  by  thus  combining  each  system  of  the 
first  ensemble  with  each  of  the  second. 

At  the  initial  moment,  which  may  be  specified  by  a  single 
accent,  the  probability-coefficient  of  any  phase  of  the  combined 
systems  is  evidently  the  product  of  the  probability-coefficients 
of  the  phases  of  which  it  is  made  up.  This  may  be  expressed 
by  the  equation, 

^'  =  <j*'e*',  (456) 

or  W  =  %'  +  *'t  («6) 

which  gives  rut  =  vi  +  *)*•  (467) 

The  forces  tending  to  vary  the  internal  coordinates  of  the 
combined  systems,  together  with  those  exerted  by  either 
system  upon  the  bodies  represented  by  the  coordinates  called 

•  See  Chapter  IV,  page  87. 
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external,  may  be  derived  from  a  single  force-function,  which, 
taken  negatively,  we  shall  call  the  potential  energy  of  the 
combined  systems  and  denote  by  elr  Bnt  we  suppose  that 
initially  none  of  the  systems  of  the  two  ensembles  Ex  and 
Et  come  within  range  of  each  other's  action,  so  that  the 
potential  energy  of  the  combined  system  falls  into  two  parts 
relating  separately  to  the  systems  which  are  combined.  The 
same  is  obviously  true  of  the  kinetic  energy  of  the  combined 
compound  system,  and  therefore  of  its  total  energy.  This 
may  be  expressed  by  the  equation 

€*'=€/  +  €,',  (468) 

which  gives  V  =  W  +  €.'.  (459) 

Let  us  now  suppose  that  in  the  course  of  time,  owing  to  the 
motion  of  the  bodies  represented  by  the  coordinates  called 
external,  the  forces  acting  on  the  systems  and  consequently 
their  positions  are  so  altered,  that  the  systems  of  the  ensembles 
Ex  and  JS%  are  brought  within  range  of  each  other's  action, 
and  after  such  mutual  influence  has  lasted  for  a  time,  by  a 
further  change  in  the  external  coordinates,  perhaps  a  return 
to  their  original  values,  the  systems  of  the  two  original  en- 
sembles are  brought  again  out  of  range  of  each  other's  action. 
Finally,  then,  at  a  time  specified  by  double  accents,  we  shall 
have  as  at  first 

«i."  =  *"  +  i«".  (460) 

But  for  the  indices  of  probability  we  must  write  * 

V'  +  W'^W-  (461) 

The  considerations  adduced  in  the  last  chapter  show  that  it 
is  safe  to  write 

W^W-  (462) 

We  have  therefore 

*"  +  *"<Vi'  +  Tt',  (463) 

which  may  be  compared  with  the  thermodynamic  theorem  that 
*  Bee  Chapter  XI,  Theorem  VIL 
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the  thermal  contact  of  two  bodies  may  increase  but  cannot 
diminish  the  sum  of  their  entropies. 

Let  ue  especially  consider  the  case  in  which  the  two  original 
ensembles  were  both  canonically  distributed  in  phase  with  the 
respective  moduli  &t  and  ©r  We  have  then,  by  Theorem  III 
of  Chapter  XI, 

*'  +  %£*"+%  (464) 

V  +  |[SV  +  ^  (466) 

Whence  with  (468)  we  have 

7.H  —  7,1      7.11  —  7j 

Hr-+J-*r-*0'  (467) 

If  we  write  W  for  the  average  work  done  by  the  combined 
systems  on  the  external  bodies,  we  have  by  the  principle  of 
the  conservation  of  energy 

W=  €ls'  -  elf"  =  *'  -  *"  +  i,'  -  *".  (468) 

Now  if  IP  is  negligible,  we  have 

7// _i/ =  -(#/-#)  («9) 

and  (467)  shows  that  the  ensemble  which  has  the.,  greater 
modulus  must  lose  energy.  This  result  may  be  compared  to 
the  thermodynamic  principle,  that  when  two  bodies  of  differ- 
ent temperatures  are  brought  together,  that  which  has  the 
higher  temperature  will  lose  energy. 

Let  us  next  suppose  that  the  ensemble  E%  is  originally 
canonically  distributed  with  the  modulus  0,,  but  leave  the 
distribution  of  the  other  arbitrary.  We  have,  to  determine 
the  result  of  a  similar  process, 
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Hence                         *"  +  |j^*'  +  |£ 

(470) 

which  may  be  written 

(471) 

Tliis  may  be  compared  with  the  thermodynamic  principle  that 
when  a  body  (which  need  not  be  in  thermal  equilibrium)  is 
brought  into  thermal  contact  with  another  of  a  given  tempera- 
ture, the  increase  of  entropy  of  the  first  cannot  be  less  (alge- 
braically) than  the  loss_of  heat  by  the  second  divided  by  its 
temperature.    Where  W  is  negligible,  we  may  write 

*"  +  |£<*'  +  !  (472) 

Now,  by  Theorem  III  of  Chapter  XI,  the  quantity 

%  +  §  (473) 

has  a  minimum  value  when  the  ensemble  to  which  ^  and  ex 
relate  is  distributed  canonically  with  the  modulus  %r  If  the 
ensemble  had  originally  this  distribution,  the  sign  <  in  (472) 
would  be  impossible.  In  fact,  in  this  case,  it  would  be  easy  to 
show  that  the  preceding  formulae  on  which  (472)  is  founded 
would  all  have  the  sign  =  .  But  when  the  two  ensembles  are 
not  both  originally  distributed  canonically  with  the  same 
modulus,  the  formulae  indicate  that  the  quantity  (478)  may 
be  diminished  by  bringing  the  ensemble  to  which  ex  and  rjx 
relate  into  connection  with  another  which  is  canonically  dis- 
tributed with  modulus  ©2,  and  therefore,  that  by  repeated 
operations  of  this  kind  the  ensemble  of  which  the  original  dis- 
tribution was  entirely  arbitrary  might  be  brought  approxi- 
mately into  a  state  of  canonical  distribution  with  the  modulus 
©,.  We  may  compare  this  with  the  thermodynamic  principle 
that  a  body  of  which  the  original  thermal  state  may  be  entirely 
arbitrary,  may  be  brought  approximately  into  a  state  of  ther- 
mal equilibrium  with  any  given  temperature  by  repeated  con- 
nections with  other  bodies  of  that  temperature. 
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Let  us  now  suppose  that  we  have  a  certain  number  of 
ensembles,  JE?0,  Hl9 E%9  etc.,  distributed  canonically  with  the 
respective  moduli  O0 ,  %t ,  0, ,  etc.  By  variation  of  the  exter- 
nal coordinates  of  the  ensemble  H0 ,  let  it  be  brought  into 
connection  with  Bx ,  and  then  let  the  connection  be  broken. 
Let  it  then  be  brought  into  connection  with  E% ,  and  then  let 
that  connection  be  broken.  Let  this  process  be  continued 
with  respect  to  the  remaining  ensembles.  We  do  not  make 
the  assumption,  as  in  some  cases  before,  that  the  work  connected 
with  the  variation  of  the  external  coordinates  is  a  negligible 
quantity.  On  the  contrary,  we  wish  especially  to  consider 
the  case  in  which  it  is  large.  In  the  final  state  of  the  ensem- 
ble E0 ,  let  us  suppose  that  the  external  coordinates  have  been 
brought  back  to  their  original  values,  and  that  the  average 
energy  (e0)  is  the  same  as  at  first. 

In  our  usual  notations,  using  one  and  two  accents  to  dis- 
tinguish original  and  final  values,  we  get  by  repeated  applica- 
tions of  the  principle  expressed  in  (468) 

V  +  ni  +  V  +  etc.  >  ^0"  +  vi"  +  vt"  +  etc.        (474) 
But  by  Theorem  III  of  Chapter  XI, 

*"  +  l^*'  +  |'>  (476) 

*"  +  |J>*'  +  |£  (477) 

etc. 

Hence       |!!  +  ^  +  ^  + etc.  >  ^ +  ^  + |  + etc.        (478) 

©0         ©l         ©«  ~  ©o        ©i        ®i  x        ' 

or,  since  i</  =  io", 

Osi^  +  ^  +  e*.  (479) 

If  we  write  IP  for  the  average  work  done  on  the  bodies  repre- 
sented by  the  external  coordinates,  we  have 
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5/  _  exn  +  ^  _  £//  +  etc  =  W.  (480) 

If  J?09  JBV  and  J^  are  the  only  ensembles,  we  have 

^^^-g^  (*'-*")•  (481) 

It  will  be  observed  that  the  relations  expressed  in  the  last 
three  formulae  between  W,  \  —  6j",  ej'  —  ej",  etc,,  and  01, 
Bv  etc.  are  precisely  those  which  hold  in  a  Caraot's  cycle  for 
the  work  obtained,  the  energy  lost  by  the  several  bodies  which 
serve  as  heaters  or  coolers,  and  their  initial  temperatures. 

It  will  not  escape  the  reader's  notice,  that  while  from  one 
point  of  view  the  operations  which  are  here  described  are  quite 
beyond  our  powers  of  actual  performance,  on  account  of  the 
impossibility  of  handling  the  immense  number  of  systems 
which  are  involved,  yet  from  another  point  of  view  the  opera- 
tions described  are  the  most  simple  and  accurate  means  of 
representing  what  actually  takes  place  in  our  simplest  experi- 
ments in  thermodynamics.  The  states  of  the  bodies  which 
we  handle  are  certainly  not  known  to  us  exactly.  What  we 
know  about  a  body  can  generally  be  described  most  accurately 
and  most  simply  by  saying  that  it  is  one  taken  at  random 
from  a  great  number  (ensemble)  of  bodies  which  are  com- 
pletely described.  If  we  bring  it  into  connection  with  another 
body  concerning  which  we  have  a  similar  limited  knowledge, 
the  state  of  the  two  bodies  is  properly  described  as  that  of  a 
pair  of  bodies  taken  from  a  great  number  (ensemble)  of  pairs 
which  are  formed  by  combining  each  body  of  the  first  en- 
semble with  each  of  the  second. 

Again,  when  we  bring  one  body  into  thermal  contact  with 
another,  for  example,  in  a  Carnot's  cycle,  when  we  bring  a 
mass  of  fluid  into  thermal  contact  with  some  other  body  from 
which  we  wish  it  to  receive  heat,  we  may  do  it  by  moving  the 
vessel  containing  the  fluid.  This  motion  is  mathematically 
expressed  by  the  variation  of  the  coordinates  which  determine 
the  position  of  the  vessel.  We  allow  ourselves  for  the  pur- 
poses of  a  theoretical  discussion  to  suppose  that  the  walls  of 
this  vessel  are  incapable  of  absorbing  heat  from  the  fluid. 
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Yet  while  we  exclude  the  kind  of  action  which  we  call  ther- 
mal between  the  fluid  and  the  containing  vessel,  we  allow  the 
kind  which  we  call  work  in  the  narrower  sense,  which  takes 
place  when  the  volume  of  the  fluid  is  changed  by  the  motion 
of  a  piston.  This  agrees  with  what  we  have  supposed  in 
regard  to  the  external  coordinates,  which  we  may  vary  in 
any  arbitrary  manner,  and  are  in  this  entirely  unlike  the  co- 
ordinates of  the  second  ensemble  with  which  we  bring  the 
first  into  connection. 

When  heat  passes  in  any  thermodynamic  experiment  between 
the  fluid  principally  considered  and  some  other  body,  it  is 
actually  absorbed  and  given  out  by  the  walls  of  the  vessel, 
which  will  retain  a  varying  quantity.  This  is,  however,  a 
disturbing  circumstance,  which  we  suppose  in  some  way  made 
negligible,  and  actually  neglect  in  a  theoretical  discussion. 
In  our  case,  we  suppose  the  walls  incapable  of  absorbing  en- 
ergy, except  through  the  motion  of  the  external  coordinates, 
but  that  they  allow  the  systems  which  they  contain  to  act 
directly  on  one  another.  Properties  of  this  kind  are  mathe- 
matically expressed  by  supposing  that  in  the  vicinity  of  a 
certain  surface,  the  position  of  which  is  determined  by  certain 
(external)  coordinates,  particles  belonging  to  the  system  in 
question  exjjerience  a  repulsion  from  the  surface  increasing  so 
rapidly  with  nearness  to  the  surface  that  an  infinite  expendi- 
ture of  energy  would  be  required  to  carry  them  through  it. 
It  is  evident  that  two  systems  might  be  separated  by  a  surface 
or  surfaces  exerting  the  proper  forces,  and  yet  approach  each 
other  closely  enough  to  exert  mechanical  action  on  each  other. 
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CHAPTER  XIV. 

DISCUSSION  OF  THERMODYNAMIC  ANALOGIES. 

If  we  wish  to  find  in  rational  mechanics  an  a  priori  founda- 
tion for  the  principles  of  thermodynamics*  we  must  seek 
mechanical  definitions  of  temperature  and  entropy.  The 
quantities  thus  defined  must  satisfy  (under  conditions  and 
with  limitations  which  again  must  be  specified  in  the  language 
of  mechanics)  the  differential  equation 

de  =  Tdtf  -  Aldal  -  A* <fa,  -  etc^  (482) 

where  c,  T,  and  q  denote  the  energy,  temperature,  and  entropy 
of  the  system  considered,  and  Axdav  etc.,  the  mechanical  work 
(in  the  narrower  sense  in  which  the  term  is  used  in  thermo- 
dynamics, t.  £.,  with  exclusion  of  thermal  action)  done  upon 
external  bodies. 

This  implies  that  we  are  able  to  distinguish  in  mechanical 
terms  the  thermal  action  of  one  system  on  another  from  that 
which  we  call  mechanical  in  the  narrower  sense,  if  not  indeed 
in  every  case  in  which  the  two  may  be  combined,  at  least  so  as 
to  specify  cases  of  thermal  action  and  cases  of  mechanical 
action. 

Such  a  differential  equation  moreover  implies  a  finite  equa- 
tion between  e,  17,  and  av  ar  etc.,  which  may  be  regarded 
as  fundamental  in  regard  to  those  properties  of  the  system 
which  we  call  thermodynamic,  or  which  may  be  called  so  from 
analogy.  This  fundamental  thermodynamic  equation  is  de- 
termined by  the  fundamental  mechanical  equation  which 
expresses  the  energy  of  the  system  as  function  of  its  mo- 
menta and  coordinates  with  those  external  coordinates  (av  av 
etc.)  which  appear  in  the  differential  expression  of  the  work 
done  on  external  bodies.  We  have  to  show  the  mathematical 
operations  by  which  the  fundamental  thermodynamic  equation, 
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which  in  general  is  an  equation  of  few  variables,  is  derived 
from  the  fundamental  mechanical  equation,  which  in  the  case 
of  the  bodies  of  nature  is  one  of  an  enormous  number  of 
variables. 

We  have  also  to  enunciate  in  mechanical  terms,  and  to 
prove,  what  we  call  the  tendency  of  heat  to  pass  from  a  sys- 
tem of  higher  temperature  to  one  of  lower,  and  to  show  that 
this  tendency  vanishes  with  respect  to  systems  of  the  same 
temperature. 

At  least,  we  have  to  show  by  a  priori  reasoning  that  for 
such  systems  as  the  material  bodies  which  nature  presents  to 
us,  these  relations  hold  with  such  approximation  that  they 
are  sensibly  true  for  human  faculties  of  observation.  This 
indeed  is  all  that  is  really  necessary  to  establish  the  science  of 
thermodynamics  on  an  a  priori  basis.  Tet  we  will  naturally 
desire  to  find  the  exact  expression  of  those  principles  of  which 
the  laws  of  thermodynamics  are  the  approximate  expression. 
A  very  little  study  of  the  statistical  properties  of  conservative 
systems  of  a  finite  number  of  degrees  of  freedom  is  sufficient 
to  make  it  appear,  more  or  less  distinctly,  that  the  general 
laws  of  thermodynamics  are  the  limit  toward  which  the  exact 
laws  of  such  systems  approximate,  when  their  number  of 
degrees  of  freedom  is  indefinitely  increased.  And  the  problem 
of  finding  the  exact  relations,  as  distinguished  from  the  ap- 
proximate, for  systems  of  a  great  number  of  degrees  of  free- 
dom, is  practically  the  same  as  that  of  finding  the  relations 
which  hold  for  any  number  of  degrees  of  freedom,  as  distin- 
guished from  those  which  have  been  established  on  an  em- 
pirical basis  for  systems  of  a  great  number  of  degrees  of 
freedom. 

The  enunciation  and  proof  of  these  exact  laws,  for  systems 
of  any  finite  number  of  degrees  of  freedom,  has  been  a  princi- 
pal object  of  the  preceding  discussion.  But  it  should  be  dis- 
tinctly stated  that,  if  the  results  obtained  when  the  numbers 
of  degrees  of  freedom  are  enormous  coincide  sensibly  with- 
the  general  laws  of  thermodynamics,  however  interesting  and 
significant  this  coincidence  may  be,  we  are  still  far  from 
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having  explained  the  phenomena  of  nature  with  respect  to 
these  laws.  For,  as  compared  with  the  case  of  nature,  the 
systems  which  we  have  considered  are  of  an  ideal  simplicity. 
Although  our  only  assumption  is  that  we  are  considering 
conservative  systems  of  a  finite  number  of  degrees  of  freedom, 
it  would  seem  that  this  is  assuming  far  too  much,  so  far  as  the 
bodies  of  nature  are  concerned.  The  phenomena  of  radiant 
heat,  which  certainly  should  not  be  neglected  in  any  complete 
system  of  thermodynamics,  and  the  electrical  phenomena 
associated  with  the  combination  of  atoms,  seem  to  show  that 
the  hypothesis  of  systems  of  a  finite  number  of  degrees  of 
freedom  is  inadequate  for  the  explanation  of  the  properties  of 
bodies. 

Nor  do  the  results  of  such  assumptions  in  every  detail 
appear  to  agree  with  experience.  We  should  expect,  for 
example,  that  a  diatomic  gas,  so  far  as  it  could  be  treated 
independently  of  the  phenomena  of  radiation,  or  of  any  sort  of 
electrical  manifestations,  would  have  six  degrees  of  freedom 
for  each  molecule.  But  the  behavior  of  such  a  gas  seems  to 
indicate  not  more  than  five. 

But  although  these  difficulties,  long  recognized  by  physi- 
cists,* seem  to  prevent,  in  the  present  state  of  science,  any 
satisfactory  explanation  of  the  phenomena  of  thermodynamics 
as  presented  to  us  in  nature,  the  ideal  case  of  systems  of  a 
finite  number  of  degrees  of  freedom  remains  as  a  subject 
which  is  certainly  not  devoid  of  a  theoretical  interest,  and 
which  may  serve  to  point  the  way  to  the  solution  of  the  far 
more  difficult  problems  presented  to  us  by  nature.  And  if 
the  study  of  the  statistical  properties  of  such  systems  gives 
us  an  exact  expression  of  laws  which  in  the  limiting  case  take 
the  form  of  the  received  laws  of  thermodynamics,  its  interest 
is  so  much  the  greater. 

Now  we  have  defined  what  we  have  called  the  modulus  (©) 
of  an  ensemble  of  systems  canonically  distributed  in  phase, 
and  what  we  have  called  the  index  of  probability  (17)  of  any 
phase  in  such  an  ensemble.    It  has  been  shown  that  between 

•  See  Boltzmann,  Sltib.  der  Wiener  Akad.,  Bd.  LXIIL,  S.  418,  (1871). 
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the  modulus  (©),  the  external  coordinates  (ax ,  etc.),  and  the 
average  values  in  the  ensemble  of  the  energy  (e),  the  index 
of  probability  (17),  and  the  Bxternal  forces  (A19  etc.)  exerted 
by  the  systems,  the  following  differential  equation  will  hold : 

de  =  —  %&i  —  Axdax  —  A%da%  —  etc  (483) 

This  equation,  if  we  neglect  the  sign  of  averages,  is  identical 
in  form  with  the  thermodynamic  equation  (482),  the  modulus 
(0)  corresponding  to  temperature,  and  the  index  of  probabil- 
ity of  phase  with  its  sign  reversed  corresponding  to  entropy.4 

We  have  also  shown  that  the  average  square  of  the  anoma- 
lies of  e,  that  is,  of  the  deviations  of  the  individual  values  from 
the  average,  is  in  general  of  the  same  order  of  magnitude  as 
the  reciprocal  of  the  number  of  degrees  of  freedom,  and  there- 
fore to  human  observation  the  individual  values  are  indistin- 
guishable from  the  average  values  when  the  number  of  degrees 
of  freedom  is  very  great,  f  In  this  case  also  the  anomalies  of  17 
are  practically  insensible.  The  same  is  true  of  the  anomalies  of 
the  external  forces  (Ax ,  etc.),  so  far  as  these  are  the  result  of 
the  anomalies  of  energy,  so  that  when  these  forces  are  sensibly 
determined  by  the  energy  and  the  external  coordinates,  and 
the  number  of  degrees  of  freedom  is  very  great,  the  anomalies 
of  these  forces  are  insensible. 

The  mathematical  operations  by  which  the  finite  equation 
between  c,  77,  and  a^ ,  etc.,  is  deduced  from  that  which  gives 
the  energy  (e)  of  a  system  in  terms  of  the  momenta  (px . . .  .p^) 
and  coordinates  both  internal  (qt . . .  j„)  and  external  (a^ ,  etc.), 
are  indicated  by  the  equation 

f  all  C 

e    e=/...l6   *dqx  ..  .dqndpx  ...  dpnf  (484) 


where  ^  =  ®rj  +  e. 

We  have  also  shown  that  when  systems  of  different  ensem- 
bles are  brought  into  conditions  analogous  to  thermal  contact, 
the  average  result  is  a  passage  of  energy  from  the  ensemble 

•  See  Chapter  IV,  pages  44,  46.  t  See  Chapter  VII,  pages  73-76. 
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of  the  greater  modulus  to  that  of  the  less,  *  or  in  case  of  equal 
moduli,  that  we  have  a  condition  of  statistical  equilibrium  in 
regard  to  the  distribution  of  energy,  f 

Propositions  have  also  been  demonstrated  analogous  to 
those  in  thermodynamics  relating  to  a  Carnot's  cyole,$  or  to 
the  tendency  of  entropy  to  increase,  §  especially  when  bodies 
of  different  temperature  are  brought  into  contact.  | 

We  have  thus  precisely  defined  quantities,  and  rigorously 
demonstrated  propositions,  which  hold  for  any  number  of 
degrees  of  freedom,  and  which,  when  the  number  of  degrees 
of  freedom  (n)  is  enormously  great,  would  appear  to  human 
faculties  as  the  quantities  and  propositions  of  empirical  ther- 
modynamics. 

It  is  evident,  however,  that  there  may  be  more  than  one 
quantity  defined  for  finite  values  of  n,  which  approach  the 
same  limit,  when  n  is  increased  indefinitely,  and  more  than  one 
proposition  relating  to  finite  values  of  n,  which  approach  the 
same  limiting  form  for  n  =  oo.  There  may  be  therefore, 
and  there  are,  other  quantities  which  may  be  thought  to  have 
some  claim  to  be  regarded  as  temperature  and  entropy  with 
respect  to  systems  of  a  finite  number  of  degrees  of  freedom. 

The  definitions  and  propositions  which  we  have  been  con- 
sidering relate  essentially  to  what  we  have  called  a  canonical 
ensemble  of  systems.  This  may  appear  a  less  natural  and 
simple  conception  than  what  we  have  called  a  microcanonical 
ensemble  of  systems,  in  which  all  have  the  same  energy,  and 
which  in  many  cases  represents  simply  the  time~en*errible%  or 
ensemble  of  phases  through  which  a  single  system  passes  in 
the  course  of  time. 

It  may  therefore  seem  desirable  to  find  definitions  and 
propositions  relating  to  these  microcanonical  ensembles,  which 
shall  correspond  to  what  in  thermodynamics  are  based  on 
experience.    Now  the  differential  equation 

d€  =  e^rrflogr-3^  <fax  -  2^  do,  -  etc.,       (485) 

•  See  Chapter  Xm,  page  160.  t  See  Chapter  IV,  pages  86-87. 

X  See  Chapter  XIII,  pages  162, 168.       $  See  Chapter  XII,  pages  148-161. 
H  See  Chapter  XIII,  page  169. 
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which  has  been  demonstrated  in  Chapter  X,  and  which  relates  to 
a  microcanonical  ensemble,  "3^  denoting  the  average  value  of 
At  in  such  an  ensemble,  corresponds  precisely  to  the  thermody- 
namic equation,  except  for  the  sign  of  average  applied  to  the 
external  forces.  But  as  these  forces  are  not  entirely  deter- 
mined by  the  energy  with  the  external  coordinates,  the  use  of 
average  values  is  entirely  germane  to  the  subject,  and  affords 
the  readiest  means  of  getting  perfectly  determined  quantities. 
These  averages,  which  are  taken  for  a  microcanonical  ensemble, 
may  seem  from  some  points  of  view  a  more  simple  and  natural 
conception  than  those  which  relate  to  a  canonical  ensemble. 
Moreover,  the  energy,  and  the  quantity  corresponding  to  en- 
tropy, are  free  from  the  sign  of  average  in  this  equation. 

The  quantity  in  the  equation  which  corresponds  to  entropy 
is  log  V,  the  quantity  V  being  defined  as  the  extension-in- 
phase  within  which  the  energy  is  less  than  a  certain  limiting 
value  (e).  This  is  certainly  a  more  simple  conception  than  the 
average  value  in  a  canonical  ensemble  of  the  index  of  probabil- 
ity of  phase.    Log  V  has  the  property  that  when  it  is  constant 

de  =  -2^  <fai  -  3^t  d<h  +  etc.,  (486) 

which  closely  corresponds  to  the  thermodynamic  property  of 
entropy,  that  when  it  is  constant 

de  =  -  A1  dc^  -  At  da*  +  etc.  (487) 

The  quantity  in  the  equation  which  corresponds  to  tem- 
perature is  «"*  Vy  or  dejd  log  V.  In  a  canonical  ensemble,  the 
average  value  of  this  quantity  is  equal  to  the  modulus,  as  has 
been  shown  by  different  methods  in  Chapters  IX  and  X. 

In  Chapter  X  it  has  also  been  shown  that  if  the  systems 
of  a  microcanonical  ensemble  consist  of  parts  with  separate 
energies,  the  average  value  of  e-*  Vi or  any  part  is  equal  to  its 
average  value  for  any  other  part,  and  to  the  uniform  value 
of  the  same  expression  for  the  whole  ensemble.  This  corre- 
sponds to  the  theorem  in  the  theory  of  heat  that  in  case  of 
thermal  equilibrium  the  temperatures  of  the  parts  of  a  body 
are  equal  to  one  another  and  to  that  of  the  whole  body. 
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Since  the  energies  of  the  parts  of  a  body  cannot  be  supposed 
to  remain  absolutely  constant,  even  where  this  is  the  case 
with  respect  to  the  whole  body,  it  is  evident  that  if  we  regard 
the  temperature  as  a  function  of  the  energy,  the  taking  of 
average  or  of  probable  values,  or  some  other  statistical  process, 
must  be  used  with  reference  to  the  parts,  in  order  to  get  a 
perfectly  definite  value  corresponding  to  the  notion  of  tem- 
perature. 

It  is  worthy  of  notice  in  this  connection  that  the  average 
value  of  the  kinetic  energy,  either  in  a  microcanonical  en- 
semble, or  in  a  canonical,  divided  by  one  half  the  number  of 
degrees  of  freedom,  is  equal  to  r~+  V>  or  to  its  average  value, 
and  that  this  is  true  not  only  of  the  whole  system  which  is 
distributed  either  microcanonically  or  canonically,  but  also 
of  any  part,  although  the  corresponding  theorem  relating  to 
temperature  hardly  belongs  to  empirical  thermodynamics,  since 
neither  the  (inner)  kinetic  energy  of  a  body,  nor  its  number 
of  degrees  of  freedom  is  immediately  cognizable  to  our  facul- 
ties, and  we  meet  the  gravest  difficulties  when  we  endeavor 
to  apply  the  theorem  to  the  theory  of  gases,  except  in  the 
simplest  case,  that  of  the  gases  known  as  monatomic. 

But  the  correspondence  between  r^For  dejd log  V  and 
temperature  is  imperfect.  If  two  isolated  systems  have  such 
energies  that 

dex      _      <fe, 
rflog  V1~dlog  V%* 

and  the  two  systems  are  regarded  as  combined  to  form  a  third 
system  with  energy 

Cis  =  *i  +  *i, 

we  shall  not  have  in  general 

rflog  ru""rflog  Pi~rflog  V* 

as  analogy  with  temperature  would  require.  In  fact,  we  have 
seen  that 


<*logFlt  =  <*logFi|ru  =  dlog  Fj.u' 
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where  the  second  and  third  members  of  the  equation  denote 
average  values  in  an  ensemble  in  which  the  compound  system 
is  microcanonically  distributed  in  phase.  Let  us  suppose  the 
two  original  systems  to  be  identical  in  nature.    Then 

€j  =  €%  =  ^ta  =  cHf^ . 
The  equation  in  question  would  require  that 


dex  3eT 


dlog  V^dlog  FiL* 


i.  e.,  that  we  get  the  same  result,  whether  we  take  the  value 
of  dejdlog  V\  determined  for  the  average  value  of  ex  in  the 
ensemble,  or  take  the  average  value  of  dejdlog  Vv  This 
will  be  the  case  where  dejdlog  Vx  is  a  linear  function  of  er 
Evidently  this  does  not  constitute  the  most  general  case. 
Therefore  the  equation  in  question  cannot  be  true  in  general. 
It  is  true,  however,  in  some  very  important  particular  cases,  as 
when  the  energy  is  a  quadratic  function  of  the  p'&  and  q%  or 
of  the  ^'s  alone.*  When  the  equation  holds,  the  case  is  anal- 
ogous to  that  of  bodies  in  thermodynamics  for  which  the 
specific  heat  for  constant  volume  is  constant. 

Another  quantity  which  is  closely  related  to  temperature  is 
d<f>/d€.  It  has  been  shown  in  Chapter  IX  that  in  a  canonical 
ensemble,  if  n  >  2,  the  average  value  of  d<f>/de  is  1/©,  and 
that  the  most  common  value  of  the  energy  in  the  ensemble  is 
that  for  which  d<f>/de  =  1/8.  The  first  of  these  properties 
may  be  compared  with  that  of  de/d  log  V>  which  has  been 
seen  to  have  the  average  value  0  in  a  canonical  ensemble, 
without  restriction  in  regard  to  the  number  of  degrees  of 
freedom. 

With  respect  to  microcanonical  ensembles  also,  d<f>/de  has 
a  property  similar  to  what  has  been  mentioned  with  respect  to 
de/d  log  V.  That  is,  if  a  system  microcanonically  distributed 
in  phase  consists  of  two  parts  with  separate  energies,  and  each 

*  This  last  case  is  important  on  account  of  its  relation  to  the  theory  of 
gases,  although  it  must  in  strictness  be  regarded  as  a  limit  of  possible  cases, 
rather  than  as  a  case  which  is  itself  possible. 
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with  more  than  two  degrees  of  freedom,  the  average  values  in 
the  ensemble  of  d<f>/cU  for  the  two  parts  are  equal  to  one 
another  and  to  the  value  of  same  expression  for  the  whole. 
In  our  usual  notations 

if  14  >  2,  and  n2>  2. 

This  analogy  with  temperature  has  the  same  incompleteness 
which  was  noticed  with  respect  to  de/dlog  V,  viz.,  if  two  sys- 
tems have  such  energies  (ex  and  es)  that 

and  they  are  combined  to  form  a  third  system  with  energy 

we  shall  not  have  in  general 

deu       det       det ' 

Thus,  if  the  energy  is  a  quadratic  function  of  the  p's  and  q% 

we  have  * 

<fyi  __  fH  —  1  d<f>t  _  n*  —  1 

^T~~      *i    '  *s~"~      €t     ' 

dfri  =  »u  —  1  _  fh  +  n,  —  1 

*li  *1S  «l  +  «S       * 

where  *h ,  n,,  nlt,  are  the  numbers  of  degrees  of  freedom  of  the 
separate  and  combined  systems.    But 

d <h  _  d  fa  _  ih  ±  nt  —  2 
d*i       d€%  €x  +  €t 

If  the  energy  is  a  quadratic  function  of  the  pys  alone,  the  case 
would  be  the  same  except  that  we  should  have  \nl%\n%y\nl%^ 
instead  of  14 ,  n,,  nlr    In  these  particular  cases,  the  analogy 

*  See  foot-note  on  page  93.  We  have  here  made  the  least  value  of  the 
energy  consistent  with  the  values  of  the  external  coordinates  sero  instead 
of  fa,  as  is  evidently  allowable  when  the  external  coordinates  are  supposed 
invariable. 
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between  cU/d  log  V  and  temperature  would  be  complete,  as  has 
already  been  remarked.    We  should  have 

dlogVt^nt9  rflog^1^J^, 

dlogVi*  "  ih,  "  rflog  Fi  "  rf  log  Vt 

when  the  energy  is  a  quadratic  function  of  the  p'a  and  q%  and 
similar  equations  with  J?^ ,  J  w^ ,  J  ^ ,  instead  of  nj ,  14 ,  nw , 
when  the  eneigy  is  a  quadratic  function  of  the  p9s  alone. 

More  characteristic  of  d<f>/de  are  its  properties  relating  to 
most  probable  values  of  energy.  If  a  system  having  two  parts 
with  separate  energies  and.  each  with  more  than  two  degrees 
of  freedom  is  microcanonically  distributed  in  phase,  the  most 
probable  division  of  energy  between  the  parts,  in  a  system 
taken  at  random  from  the  ensemble,  satisfies  the  equation 

A""^"'  (488> 

which  corresponds  to  the  thermodynamic  theorem  that  the 
distribution  of  energy  between  the  parts  of  a  system,  in  case  of 
thermal  equilibrium,  is  such  that  the  temperatures  of  the  parts 
are  equal. 

To  prove  the  theorem,  we  observe  that  the  fractional  part 
of  the  whole  number  of  systems  which  have  the  energy  of  one 
part  (ex)  between  the  limits  ex'  and  ex"  is  expressed  by 


•a" 

6  r 


delf 


where  the  variables  are  connected  by  the  equation 
ex  +  e,  =  constant  =  els. 

The  greatest  value  of  this  expression,  for  a  constant  infinitesi- 
mal value  of  the  difference  €l"  —  e/,  determines  a  value  of  et , 
which  we  may  call  its  most  probable  value.  This  depends  on 
the  greatest  possible  value  of  fa  +  fa.  Now  if  n^  >  2,  and 
14  >  2,  we  shall  have  fa  =  —  00  for  the  least  possible  value  of 
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€|,  and  <f>2  =  —  ao  for  the  least  possible  value  of  e,.  Between 
these  limits  <f>t  and  £,  will  be  finite  and  continuous.  Hence 
<f>x  +  <f>2  will  have  a  maximum  satisfying  the  equation  (488). 

But  if  nj  <  2,  or  14  <  2,  d<f^l/cUl  or  cUf^jd^  may  be  nega- 
tive, or  zero,  for  all  values  of  ex  or  er  and  can  hardly  be 
regarded  as  having  properties  analogous  to  temperature. 

It  is  also  worthy  of  notice  that  if  a  system  which  is  micro- 
canonically  distributed  in  phase  has  three  parts  with  separate 
energies,  and  each  with  more  than  two  degrees  of  freedom,  the 
most  probable  division  of  energy  between  these  parts  satisfies 
the  equation 

*i  *i  <h* ' 
That  is,  this  equation  gives  the  most  probable  set  of  values 
of  ely  €,,  and  63.  But  it  does  not  give  the  most  probable 
value  of  e19  or  of  ea,  or  of  er  Thus,  if  the  energies  are  quad- 
ratic functions  of  the  p's  and  q%  the  most  probable  division 
of  energy  is  given  by  the  equation 

f*i  —  1  _w1  —  1  __  nt  —  1 
«i     "      «i     ""     *• 
But  the  most  probable  value  of  ei  is  given  by 

*!  —  *_*«  +  *,  —  * 

«1  «S  +  «,        ' 

while  the  preceding  equations  give 

nx  —  1  Wf  +  nt  —  2 
*!  =  *s  +  * 
These  distinctions  vanish  for  very  great  values  of  %,  wa,  nj. 
For  small  values  of  these  numbers,  they  are  important  Such 
facts  seem  to  indicate  that  the  consideration  of  the  most 
probable  division  of  energy  among  the  parts  of  a  system  does 
not  afford  a  convenient  foundation  for  the  study  of  thermody- 
namic analogies  in  the  case  of  systems  of  a  small  number  of 
degrees  of  freedom.  The  fact  that  a  certain  division  of  energy 
is  the  most  probable  has  really  no  especial  physical  importance, 
except  when  the  ensemble  of  possible  divisions  are  grouped  so 
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closely  together  that  the  most  probable  division  may  fairly 
represent  the  whole.  This  is  in  general  the  case,  to  a  very 
close  approximation,  when  n  is  enormously  great;  it  entirely 
fails  when  n  is  small. 

If  we  regard  d$/d*  as  corresponding  to  the  reciprocal  of 
temperature,  or,  in  other  words,  de/d<f>  as  corresponding  to 
temperature,  <£  will  correspond  to  entropy.  It  has  been  defined 
as  log  (d  V/<U).  In  the  considerations  on  which  its  definition 
is  founded,  it  is  therefore  very  similar  to  log  V.  We  have 
seen  that  d<f>/d\og  V  approaches  the  value  unity  when  n  is 
very  great.* 

To  form  a  differential  equation  on  the  model  of  the  thermo- 
dynamic equation  (482),  in  which  de/d<f>  shall  take  the  place 
of  temperature,  and  <f>  of  entropy,  we  may  write 

or  <*♦-  g* +  £<*,  + £*,  +  «..  (490) 

With  respect  to  the  differential  coefficients  in  the  last  equa- 
tion, which  corresponds  exactly  to  (482)  solved  with  respect 
to  dy,  we  have  seen  that  their  average  values  in  a  canonical 
ensemble  are  equal  to  I/O,  and  the  averages  of  A1/%i  ^3/9, 
etc.f  We  have  also  seen  that  dejcty  (or  d(f>/de)  has  relations 
to  the  most  probable  values  of  energy  in  parts  of  a  microca- 
nonical  ensemble.  That  (del  da^)ffmy  etc.,  have  properties 
somewhat  analogous,  may  be  shown  as  follows. 

In  a  physical  experiment,  we  measure  a  force  by  balancing  it 
against  another.  If  we  should  ask  what  force  applied  to  in- 
crease or  diminish  at  would  balance  the  action  of  the  systems, 
it  would  be  one  which  varies  with  the  different  systems.  But 
we  may  ask  what  single  force  will  make  a  given  value  of  ax 
the  most  probable,  and  we  shall  find  that  under  certain  condi- 
tions (de/dajfra  represents  that  force. 

•  See  Chapter  X,  pages  190, 121. 

t  See  Chapter  IX,  equations  (821),  (827). 
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To  make  the  problem  definite,  let  us  consider  a  system  con- 
sisting of  the  original  system  together  with  another  having 
the  coordinates  ax,  at,  etc-,  and  forces  At'9  AJy  etc-,  tending 
to  increase  those  coordinates.  These  are  in  addition  to  the 
forces  Av  A'v  etc,  exerted  by  the  original  system,  and  are  de- 
rived from  a  force-function  (—  «/)  by  the  equations 

*'— g»    -v— at'    eto- 

For  the  energy  of  the  whole  system  we  may  write 

E  as  c  +  cj  +  \miaf  +  1*4 a,1  +  etc, 

and  for  the  extension-in-phase  of  the  whole  system  within  any 
limits 

/  . .  •  /  dpi  . . .  dqudaxmxdaxda%mtdat  . .  . 

or  /  *  *  *  i  **<Z"&H9l|i<fai<fai)fts<&*s  •  •  •> 

or  again  L.l  el^  dEdaxmxdald(t%7ntdat . .  ., 

since  de  =  (JE,  when  aXl  au  a*,  «t,  etc.,  are  constant.  If  the 
limits  are  expressed  by  E  and  E  +  <2E,  ax  and  ax  +  dax ,  ax  and 
Oj  +  dax ,  etc,  the  integral  reduces  to 

tr  dEidaiVtida^da^fi^darg.  •  • 

The  values  of  ax,  o^,  a,,  o^,  etc.,  which  make  this  expression 
a  maximum  for  constant  values  of  the  energy  of  the  whole 
system  and  of  the  differentials  <2E,  da19  dav  etc.,  are  what  may 
be  called  the  most  probable  values  of  ax ,  ^ ,  etc.,  in  an  ensem- 
ble in  which  the  whole  system  is  distributed  microcanonically. 
To  determine  these  values  we  have 

when  d(e  +  c/  +  ^maf  +  \mtaf  +  etc)  =  0. 

That  is,  d+  =  0, 
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when 

(^  )  ^  +  (  Ai" )     *h  — -*i'*H  +  et0*  +  «i«i&h  +  etc.  =s  0. 

This  requires  a%  =  0,    a,  =  0,    etc., 

-      (£)M-«.  (£)*.-*•  * 

This  shows  that  for  any  given  values  of  E,  o^  a,,  etc. 
( -3—  )  ,  (  j-  )  >  ete->  represent  the  forces  (in  the  gen- 
eralized sense)  which  the  external  bodies  would  have  to  exert 
to  make  these  values  of  d^,  aa,  etc.,  the  most  probable  under 
the  conditions  specified.  When  the  differences  of  the  external 
forces  which  are  exerted  by  the  different  systems  are  negli- 
gible,—  (de/da^^ai  etc.,  represent  these  forces. 

It  is  certainly  in  the  quantities  relating  to  a  canonical 
ensemble,  e,  6,  rjy  Aly  etc.,  ax,  etc.,  that  we  find  the  most 
complete  correspondence  with  the  quantities  of  the  thermody- 
namic equation  (482).  Yet  the  conception  itself  of  the  canon- 
ical ensemble  may  seem  to  some  artificial,  and  hardly  germane 
to  a  natural  exposition  of  the  subject;  and  the  quantities 

*'  ;risiF*  l°s  r>  *?„»  etc->  «i> etc-> orc>  ^  *(s0*.f 

etc.,  al9  etc.,  which  are  closely  related  to  ensembles  of  constant 
energy,  and  to  average  and  most  probable  values  in  such 
ensembles,  and  most  of  which  are  defined  without  reference 
to  any  ensemble,  giay  appear  the  most  natural  analogues  of 
the  thermodynamic  quantities. 

In  regard  to  the  naturalness  of  seeking  analogies  with  the 
thermodynamic  behavior  of  bodies  in  canonical  or  microca- 
nonical  ensembles  of  systems,  much  will  depend  upon  how  we 
approach  the  subject,  especially  upon  the  question  whether  we 
regard  energy  or  temperature  as  an  independent  variable. 

It  is  very  natural  to  take  energy  for  an  independent  variable 
rather  than  temperature,  because  ordinary  mechanics  furnishes 
us  with  a  perfectly  defined  conception  of  energy,  whereas  the 
idea  of  something  relating  to  a  mechanical  system  and  corre- 
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sponding  to  temperature  is  a  notion  but  vaguely  defined*  Now 
if  the  state  of  a  system  is  given  by  its  energy  and  the  external 
coordinates,  it  is  incompletely  defined,  although  its  partial  defi- 
nition is  perfectly  clear  as  far  as  it  goes.  The  ensemble  of 
phases  microcanonically  distributed,  with  the  given  values  of 
the  energy  and  the  external  coordinates,  will  represent  the  im- 
perfectly defined  system  better  than  any  other  ensemble  or 
single  phase.  When  we  approach  the  subject  from  this  side, 
our  theorems  will  naturally  relate  to  average  values,  or  most 
probable  values,  in  such  ensembles. 

In  this  case,  the  choice  between  the  variables  of  (485)  or  of 
(489)  will  be  determined  partly  by  the  relative  importance 
which  is  attached  to  average  and  probable  values.  It  would 
seem  that  in  general  average  values  are  the  most  important,  and 
that  they  lend  themselves  better  to  analytical  transformations. 
This  consideration  would  give  the  preference  to  the  system  of 
variables  in  which  log  V  is  the  analogue  of  entropy.  Moreover, 
if  we  make  <f>  the  analogue  of  entropy,  we  are  embarrassed  by 
the  necessity  of  making  numerous  exceptions  for  systems  of 
one  or  two  degrees  of  freedom. 

On  the  other  hand,  the  definition  of  </>  may  be  regarded  as  a 
little  more  simple  than  that  of  log  V,  and  if  our  choice  is  deter- 
mined by  the  simplicity  of  the  definitions  of  the  analogues  of 
entropy  and  temperature,  it  would  seem  that  the  <f>  system 
should  have  the  preference.  In  our  definition  of  these  quanti- 
ties, Fwas  defined  first,  and  e*  derived  from  Vbj  differen- 
tiation. This  gives  the  relation  of  the  quantities  in  the  most 
simple  analytical  form.  Tet  so  far  as  the  notions  are  con- 
cerned, it  is  perhaps  more  natural  to  regard  Fas  derived  from 
«+  by  integration.  At  all  events,  e+  may  be  denned  inde- 
pendently of  Vj  and  its  definition  may  be  regarded  as  more 
simple  as  not  requiring  the  determination  of  the  zero  from 
which  V  is  measured,  which  sometimes  involves  questions 
of  a  delicate  nature.  In  fact,  the  quantity  e+  may  exist, 
when  the  definition  of  V  becomes  illusory  for  practical  pur- 
poses, as  the  integral  by  which  it  is  determined  becomes  infinite. 

The  case  is  entirely  different,  when  we  regard  the  tempera- 
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ture  as  an  independent  variable,  and  we  have  to  consider  a 
system  which  is  described  as  having  a  certain  temperature  and 
certain  values  for  the  external  coordinates.  Here  also  the 
state  of  the  system  is  not  completely  defined,  and  will  be 
better  represented  by  an  ensemble  of  phases  than  by  any  single 
phase.  What  is  the  nature  of  such  an  ensemble  as  will  best 
represent  the  imperfectly  defined  state  ? 

When  we  wish  to  give  a  body  a  certain  temperature,  we 
place  it  in  a  bath  of  the  proper  temperature,  and  when  we 
regard  what  we  call  thermal  equilibrium  as  established,  we  say 
that  the  body  has  the  same  temperature  as  the  bath.  Per- 
haps we  place  a  second  body  of  standard  character,  which  we 
call  a  thermometer,  in  the  bath,  and  say  that  the  first  body, 
the  bath,  and  the  thermometer,  have  all  the  same  temperature. 

But  the  body  under  such  circumstances,  as  well  as  the 
bath,  and  the  thermometer,  even  if  they  were  entirely  isolated 
from  external  influences  (which  it  is  convenient  to  suppose 
in  a  theoretical  discussion),  would  be  continually  changing  in 
phase,  and  in  energy  as  well  as  in  other  respects,  although 
our  means>  of  observation  are  not  fine  enough  to  perceive 
these  variations. 

The  series  of  phases  through  which  the  whole  system  runs 
in  the  course  of  time  may  not  be  entirely  determined  by  the 
energy,  but  may  depend  on  the  initial  phase  in  other  respects. 
In  such  cases  the  ensemble  obtained  by  the  microcanonical 
distribution  of  the  whole  system,  which  includes  all  possible 
time-ensembles  combined  in  the  proportion  which  seems  least 
arbitrary,  will  represent  better  than  any  one  time-ensemble 
the  effect  of  the  bath.  Indeed  a  single  time-ensemble,  when 
it  is  not  also  a  microcanonical  ensemble,  is  too  ill-defined  a 
notion  to  serve  the  purposes  of  a  general  discussion.  We 
will  therefore  direct  our  attention,  when  we  suppose  the  body 
placed  in  a  bath,  to  the  microcanonical  ensemble  of  phases 
thus  obtained. 

If  we  now  suppose  the  quantity  of  the  substance  forming 
the  bath  to  be  increased,  the  anomalies  of  the  separate  ener- 
gies of  the  body  and  of  the  thermometer  in  the  microcanonical 
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ensemble  will  be  inereased,  but  not  without  limit  The  anom- 
alies of  the  energy  of  the  bath,  considered  in  comparison  with 
its  whole  energy,  diminish  indefinitely  as  the  quantity  of  the 
bath  is  increased,  and  become  in  a  sense  negligible,  when 
the  quantity  of  the  bath  is  sufficiently  increased.  The 
ensemble  of  phases  of  the  body,  and  of  the  thermometer, 
approach  a  standard  form  as  the  quantity  of  the  bath  is  in- 
definitely increased.  This  limiting  form  is  easily  shown  to  be 
what  we  have  described  as  the  canonical  distribution. 

Let  us  write  €  for  the  energy  of  the  whole  system  consisting 
of  the  body  first  mentioned,  the  bath,  and  the  thermometer 
(if  any),  and  let  us  first  suppose  this  system  to  be  distributed 
canonkally  with  the  modulus  ©.    We  have  by  (206) 


(•-^-••g. 


and  since  c,m?6, 

c6     n  <K 

If  we  write  Ac  for  the  anomaly  of  mean  square,  we  hare 


(At)»«<s-y. 
If  we  set  Ae  =  -^Ae, 

A©  will  represent  approximately  the  increase  of  6  which 
would  produce  an  increase  in  the  average  value  of  the  energy 
equal  to  its  anomaly  of  mean  square.  Now  these  equations 
give 


which  shows  that  we  may  diminish  A©  indefinitely  by  increas- 
ing the  quantity  of  the  bath. 

Now  our  canonical  ensemble  consists  of  an  infinity  of  micro* 
canonical  ensembles,  which  differ  only  in  consequence  of  the 
different  values  of  the  energy  which  is  constant  in  each.  If 
we  consider  separately  the  phases  of  the  first  body  which 
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occur  in  the  canonical  ensemble  of  the  whole  system,  these 
phases  will  form  a  canonical  ensemble  of  the  same  modulus. 
This  canonical  ensemble  of  phases  of  the  first  body  will  con- 
sist of  parts  which  belong  to  the  different  microcanonical 
ensembles  into  which  the  canonical  ensemble  of  the  whole 
system  is  divided. 

Let  us  now  imagine  that  the  modulus  of  the  principal  ca- 
nonical ensemble  is  increased  by  2  A  ©,  and  its  average  energy 
by  2  A  e.  The  modulus  of  the  canonical  ensemble  of  the 
phases  of  the  first  body  considered  separately  will  be  increased 
by  2  A  8.  We  may  regard  the  infinity  of  microcanonical  en- 
sembles into  which  we  have  divided  the  principal  canonical 
ensemble  as  each  having  its  energy  increased  by  2  Ac.  Let 
us  see  how  the  ensembles  of  phases  of  the  first  body  con- 
tained in  these  microcanonical  ensembles  are  affected*  We 
may  assume  that  they  will  all  be  affected  in  about  the  same 
way,  as  all  the  differences  which  come  into  account  may  be 
treated  as  small.  Therefore,  the  canonical  ensemble  formed  by 
taking  them  together  will  also  be  affected  in  the  same  way. 
But  we  know  how  this  is  affected.  It  is  by  the  increase  of 
its  modulus  by  2  A  8,  a  quantity  which  vanishes  when  the 
quantity  of  the  bath  is  indefinitely  increased. 

In  the  case  of  an  infinite  bath,  therefore,  the  increase  of  the 
energy  of  one  of  the  microcanonical  ensembles  by  2Ae,  pro- 
duces a  vanishing  effect  on  the  distribution  in  energy  of  the 
phases  of  the  first  body  which  it  contains.  But  2Ae  is  more 
than  the  average  difference  of  energy  between  the  micro- 
canonical  ensembles.  The  distribution  in  energy  of  these 
phases  is  therefore  the  same  in  the  different  microcanonical 
ensembles,  and  must  therefore  be  canonical,  like  that  of  the 
ensemble  which  they  form  when  taken  together.* 

*  In  order  to  appreciate  the  above  reasoning,  it  should  be  understood  that 
the  differences  of  energy  which  occur  in  the  canonical  ensemble  of  phases  of 
the  first  body  are  not  here  regarded  as  vanishing  quantities.  To  fix  one's 
ideas,  one  may  imagine  that  he  has  the  fineness  of  perception  to  make  these 
differences  seem  large.  The  difference  between  the  part  of  these  phases 
which  belong  to  one  microcanonical  ensemble  of  the  whole  system  and  the 
part  which  belongs  to  another  would  still  be  imperceptible,  when  the  quan- 
tity of  the  bath  is  sufficiently  increased. 
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As  a  general  theorem,  the  conclusion  may  be  expressed  in 
the  woids:  —  If  a  system  of  a  great  number  of  degrees  of 
freedom  is  microcanonically  distributed  in  phase,  any  very 
small  part  of  it  may  be  regarded  as  canonically  distributed.4 

It  would  seem,  therefore,  that  a  canonical  ensemble  of 
phases  is  what  best  represents,  with  the  precision  necessary 
for  exact  mathematical  reasoning,  the  notion  of  a  body  with 
a  given  temperature,  if  we  conceive  of  the  temperature  as  the 
state  produced  by  such  processes  as  we  actually  use  in  physics 
to  produce  a  given  temperature.  Since  the  anomalies  of  the 
body  increase  with  the  quantity  of  the  bath,  we  can  only  get 
rid  of  all  that  is  arbitrary  in  the  ensemble  of  phases  which  is 
to  represent  the  notion  of  a  body  of  a  given  temperature  by 
making  the  bath  infinite,  which  brings  us  to  the  canonical 
distribution. 

A  comparison  of  temperature  and  entropy  with  their. ana- 
logues in  statistical  mechanics  would  be  incomplete  without  a 
consideration  of  their  differences  with  respect  to  units  and 
zeros,  and  the  numbers  used  for  their  numerical  specification. 
If  we  apply  the  notions  of  statistical  mechanics  to  such  bodies 
as  we  usually  consider  in  thermodynamics,  for  which  the 
kinetic  energy  is  of  the  same  order  of  magnitude  as  the  unit 
cf  energy,  but  the  number  of  degrees  of  freedom  is  enormous, 
the  values  of  ©,  de/dlogV,  and  d€/d<f>  will  be  of  the  same 
order  of  magnitude  as  1/w,  and  the  variable  part  of  ij,  log  V, 
and  <f>  will  be  of  the  same  order  of  magnitude  as  n.f  If  these 
quantities,  therefore,  represent  in  any  sense  the  notions  of  tem- 
perature and  entropy,  they  will  nevertheless  not  be  measured 
in  units  of  the  usual  order  of  .magnitude,  —  a  fact  which  must 
be  borne  in  mind  in  determining  what  magnitudes  may  be 
regarded  as  insensible  to  human  observation. 

Now  nothing  prevents  our  supposing  energy  and  time  in 
our  statistical  formulae  to  be  measured  in  such  units  as  may 

*  It  is  assumed— and  without  this  assumption  the  theorem  would  hare 
no  distinct  meaning — that  the  part  of  the  ensemble  considered  may  be 
regarded  as  haying  separate  energy. 

t  8ee  equations  (124),  (288),  (289),  and  (814) ;  also  page  108. 
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be  convenient  for  physical  purposes.  Bat  when  these  units 
have  been  chosen,  the  numerical  values  of  ©,  ck/ilogF, 
de/dfa  rjy  log  V,  <f>,  are  entirely  determined,4  and  in  order  to 
compare  them  with  temperature  and  entropy,  the  numerical 
values  of  which  depend  upon  an  arbitrary  unit,  we  must  mul- 
tiply all  values  of  ©,  de/d  log  V,  de/d<f>  by  a  constant  (JT), 
and  divide  all  values  of  rj,  log  V,  and  ^  by  the  same  constant. 
This  constant  is  the  same  for  all  bodies,  and  depends  only  on 
the  units  of  temperature  and  energy  which  we  employ.  For 
ordinary  units  it  is  of  the  same  order  of  magnitude  as  the 
numbers  of  atoms  in  ordinary  bodies. 

We  are  not  able  to  determine  the  numerical  value  of  JT, 
as  it  depends  on  the  number  of  molecules  in  the  bodies  with 
which  we  experiment  To  fix  our  ideas,  however,  we  may 
seek  an  expression  for  this  value,  based  upon  very  probable 
assumptions,  which  will  show  how  we  would  naturally  pro- 
ceed to  its  evaluation,  if  our  powers  of  observation  were  fine 
enough  to  take  cognizance  of  individual  molecules. 

If  the  unit  of  mass  of  a  monatomic  gas  contains  v  atoms, 
and  it  may  be  treated  as  a  system  of  Zv  degrees  of  free- 
dom, which  seems  to  be  the  case,  we  have  for  canonical 
distribution 


2  =  1*  <491> 


If  we  write  T  for  temperature,  and  ew  for  the  specific  heat  of 
the  gas  for  constant  volume  (or  rather  the  limit  toward 
which  this  specific  heat  tends,  as  rarefaction  is  indefinitely 
increased),  we  have 

since  we  may  regard  the  energy  as  entirely  kinetic.  We  may 
set  the  €p  of  this  equation  equal  to  the  ep  of  the  preceding, 

*  The  unit  of  time  only  affects  the  last  three  quantities,  and  these  only 
by  an  additive  constant,  which  disappears  (with  the  additive  constant  of 
entropy),  when  differences  of  entropy  are  compared  with  their  statistical 
analogues.    See  page  19. 
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where  indeed  the  individual  values  of  which  the  average  is 
taken  would  appear  to  human  observation  as  identical*  This 
gives 

dT~  3r' 

whence  2T=37*  (493) 

a  value  recognized  by  physicists  as  a  constant  independent  of 
the  kind  of  monatomic  gas  considered. 

We  may  also  express  the  value  of  IT  in  a  somewhat  different 
form,  which  corresponds  to  the  indirect  method  by  which 
physicists  are  accustomed  to  determine  the  quantity  c9.  The 
kinetic  energy  due  to  the  motions  of  the  centers  of  mass  of 
the  molecules  of  a  mass  of  gas  sufficiently  expanded  is  easily 
shown  to  be  equal  to 

where  p  and  v  denote  the  pressure  and  volume.  The  average 
value  of  the  same  energy  in  a  canonical  ensemble  of  such 
a  mass  of  gas  is 

where  v  denotes  the  number  of  molecules  in  the  gas.  Equat- 
ing these  values,  we  have 

j>*  =  8r,  (494) 

,  1       B     pv  ,.-_ 

whence  K^T^^T*  (49^ 

Now  the  laws  of  Boyle,  Charles,  and  Avogadro  may  be  ex- 
pressed by  the  equation 

pv  =  AvT,  (496) 

where  A  is  a  constant  depending  only  on  the  units  in  which 
energy  and  temperature  are  measured.  1  /  JT,  therefore,  might 
be  called  the  constant  of  the  law  of  Boyle,  Charles,  and 
Avogadro  as  expressed  with  reference  to  the  true  number  of 
molecules  in  a  gaseous  body. 

Since  such  numbers  are  unknown  to  us,  it  is  more  conven- 
ient to  express  the  law  with  reference  to  relative  values.  If 
we  denote  by  M  the  so-called  molecular  weight  of  a  gas,  that 
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is,  a  number  taken  from  a  table  of  numbers  proportional  to 
the  weights  of  various  molecules  and  atoms,  but  having  one 
of  the  values,  perhaps  the  atomic  weight  of  hydrogen,  arbi- 
trarily made  unity,  the  law  of  Boyle,  Charles,  and  Avogadro 
may  be  written  in  the  more  practical  form 

pv  =  A'Tjp,  (497) 

where  II  is  a  constant  and  m  the  weight  of  gas  considered. 
It  is  evident  that  1  IT  is  equal  to  the  product  of  the  constant 
of  the  law  in  this  form  and  the  (true)  weight  of  an  atom  of 
hydrogen,  or  such  other  atom  or  molecule  as  may  be  given 
the  value  unity  in  the  table  of  molecular  weights. 

In  the  following  chapter  we  shall  consider  the  necessary 
modifications  in  the  theory  of  equilibrium,  when  the  quantity 
of  matter  contained  in  a  system  is  to  be  regarded  as  variable, 
or,  if  the  system  contains  more  than  one  kind  of  matter, 
when  the  quantities  of  the  several  kinds  of  matter  in  the 
system  are  to  be  regarded  as  independently  variable.  This 
will  give  us  yet  another  set  of  variables  in  the  statistical 
equation,  corresponding  to  those  of  the  amplified  form  of 
the  thermodynamic  equation. 
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CHAPTER  XV. 

SYSTEMS  COMPOSED  OF  MOLECULES. 

The  nature  of  material  bodies  is  such  that  especial  interest 
attaches  to  the  dynamics  of  systems  composed  of  a  great 
number  of  entirely  similar  particles,  or,  it  may  be,  of  a  great 
number  of  particles  of  several  kinds,  all  of  each  kind  being 
entirely  similar  to  each  other.  We  shall  therefore  proceed  to 
consider  systems  composed  of  such  particles,  whether  in  great 
numbers  or  otherwise,  and  especially  to  consider  the  statistical 
equilibrium  of  ensembles  of  such  systems.  One  of  the  varia- 
tions to  be  considered  in  regard  to  such  systems  is  a  variation 
in  the  numbers  of  the  particles  of  the  various  kinds  which  it 
contains,  and  the  question  of  statistical  equilibrium  between 
two  ensembles  of  such  systems  relates  in  part  to  the  tendencies 
of  the  various  kinds  of  particles  to  pass  from  the  one  to  the 
other. 

First  of  all,  we  must  define  precisely  what  is  meant  by 
statistical  equilibrium  of  such  an  ensemble  of  systems.  The 
essence  of  statistical  equilibrium  is  the  permanence  of  the 
number  of  systems  which  fall  within  any  given  limits  with 
respect  to  phase.  We  have  therefore  to  define  how  the  term 
"  phase  "  is  to  be  understood  in  such  cases.  If  two  phases  differ 
only  in  that  certain  entirely  similar  particles  have  changed 
places  with  one  another,  are  they  to  be  regarded  as  identical 
or  different  phases?  If  the  particles  are  regarded  as  indis- 
tinguishable, it  seems  in  accordance  with  the  spirit  of  the 
statistical  method  to  regard  the  phases  as  identical.  In  fact, 
it  might  be  urged  that  in  such  an  ensemble  of  systems  as  we 
are  considering  no  identity  is  possible  between  the  particles 
of  different  systems  except  that  of  qualities,  and  if  v  particles 
of  one  system  are  described  as  entirely  similar  to  one  another 
and  to  v  of  another  system,  nothing  remains  on  which  to  base 
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the  indentification  of  any  particular  particle  of  the  first  system 
with  any  particular  particle  of  the  second.  And  this  would 
be  true,  if  the  ensemble  of  systems  had  a  simultaneous 
objective  existence*  But  it  hardly  applies  to  the  creations 
of  the  imagination.  In  the  cases  which  we  have  been  con- 
sidering, and  in  those  which  we  shall  consider,  it  is  not  only 
possible  to  conceive  of  the  motion  of  an  ensemble  of  similar 
systems  simply  as  possible  cases  of  the  motion  of  a  single 
system,  but  it  is  actually  in  large  measure  for  the  sake  of 
representing  more  clearly  the  possible  cases  of  the  motion  of 
a  single  system  that  we  use  the  conception  of  an  ensemble 
of  systems.  The  perfect  similarity  of  several  particles  of  a 
system  will  not  in  the  least  interfere  with  the  identification 
of  a  particular  particle  in  one  case  with  a  particular  particle 
in  another.  The  question  is  one  to  be  decided  in  accordance 
with  the  requirements  of  practical  convenience  in  the  discus- 
sion of  the  problems  with  which  we  are  engaged. 

Our  present  purpose  will  often  require  us  to  use  the  terms 
phase,  den*ity4n-phasey  statistical  equilibrium,  and  other  con- 
nected terms  on  the  supposition  that  phases  are  not  altered 
by  the  exchange  of  places  between  similar  particles.  Some 
of  the  most  important  questions  with  which  we  are  concerned 
have  reference  to  phases  thus  defined.  We  shall  call  them 
phases  determined  by  generic  definitions,  or  briefly,  generio 
phases.  But  we  shall  also  be  obliged  to  discuss  phases  de- 
fined by  the  narrower  definition  (so  that  exchange  of  position 
between  similar  particles  is  regarded  as  changing  the  phase), 
which  will  be  called  phases  determined  by  specific  definitions, 
or  briefly,  specific  phases.  For  the  analytical  description  of 
a  specific  phase  is  more  simple  than  that  of  a  generic  phase. 
And  it  is  a  more  simple  matter  to  make  a  multiple  integral 
extend  over  all  possible  specific  phases  than  to  make  one  extend 
without  repetition  over  all  possible  generic  phases. 

It  is  evident  that  if  vu  vt . . .  v*  are  the  numbers  of  the  dif- 
ferent kinds  of  molecules  in  any  system,  the  number  of  specific 
phases  embraced  in  one  generic  phase  is  represented  by  the 
continued  product  \i\  \v%  •  •  •  [jfc*  mkI  the  coefficient  of  probabil- 


Digitized  by  VjOOQIC 


SYSTEMS  COMPOSED  OF  MOLECULES.  189 

ity  of  *  generic  phase  is  the  sum  of  the  probability-coefficients 
of  the  specific  phases  which  it  represents.  When  these  are 
equal  among  themselves,  the  probability-coefficient  of  the  gen- 
eric phase  is  equal  to  that  of  the  specific  phase  multiplied  by 
[i*i  |j^. . .  |*y  It  is  also  evident  that  statistical  equilibrium 
may  subsist  with  respect  to  generic  phases  without  statistical 
equilibrium  with  respect  to  specific  phases,  butnot  vice  v*r$a. 

Similar  questions  arise  where  one  particle  is  capable  of 
several  equivalent  positions.  Does  the  change  from  one  of 
these  positions  to  another  change  the  phase?  It  would  be 
most  natural  and  logical  to  make  it  affect  the  specific  phase, 
but  not  the  generic.  The  number  of  specific  phases  contained 
in  a  generic  phase  would  then  be  ^  tcx¥l .  . .  ^  */*,  where 
*!,...**  denote  the  numbers  of  equivalent  positions  belong- 
ing to  the  several  kinds  of  particles.  The  case  in  which  a  *  is 
infinite  would  then  require  especial  attention.  It  does  not 
appear  that  the  resulting  complications  in  the  formulae  would 
be  compensated  by  any  real  advantage.  The  reason  of  this  is 
that  in  problems  of  real  interest  equivalent  positions  of  a 
particle  will  always  be  equally  probable.  In  this  respect, 
equivalent  positions  of  the  same  particle  are  entirely  unlike 
the  [^different  ways  in  which  v  particles  may  be  distributed 
in  v  different  positions.  Let  it  therefore  be  understood  that 
in  spite  of  the  physical  equivalence  of  different  positions  of 
the  same  particle  they  are  to  be  considered  as  constituting  a 
difference  of  generic  phase  as  well  as  of  specific.  The  number 
of  specific  phases  contained  in  a  generic  phase  is  therefore 
always  given  by  the  product  \vx\v%  .  « *  |»y 

Instead  of  considering,  as  in  the  preceding  chapters,  en- 
sembles of  systems  differing  only  in  phase,  we  shall  now 
suppose  that  the  systems  constituting  an  ensemble  are  com- 
posed of  particles  of  various  kinds,  and  that  they  differ  not 
only  in  phase  but  also  in  the  numbers  of  these  particles  which 
they  contain.  The  external  coordinates  of  all  the  systems  in 
the  ensemble  are  supposed,  as  heretofore,  to  have  the  same 
value,  and  when  they  vary,  to  vary  together.  For  distinction, 
we  may  Call  such  an  ensemble  a  grand  ensemble,  and  one  in 
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which  the  systems  differ  only  in  phase  *  petit  ensemble.  A 
grand  ensemble  is  therefore  composed  of  a  multitude  of  petit 
ensembles.  The  ensembles  which  we  have  hitherto  discussed 
are  petit  ensembles. 

Let  v19 . . .  vh,  etc.,  denote  the  numbers  of  the  different 
kinds  of  particles  in  a  system,  e  its  energy,  and  jlf . .  .  ?,, 
jPi »  •  •  •  P*  its  coordinates  and  momenta.  If  the  particles  are  of 
the  nature  of  material  points,  the  number  of  coordinates  (n) 
of  the  system  will  be  equal  to  8^  .  .  •  +Zvh.  But  if  the  parti- 
cles are  less  simple  in  their  nature,  if  they  are  to  be  treated 
as  rigid  solids,  the  orientation  of  which  must  be  regarded,  or 
if  they  consist  each  of  several  atoms,  so  as  to  have  more  than 
three  degrees  of  freedom,  the  number  of  coordinates  of  the 
system  will  be  equal  to  the  sum  of  v1$  i>a,  etc.,  multiplied 
each  by  the  number  of  degrees  of  freedom  of  the  kind  of 
particle  to  which  it  relates. 

Let  us  consider  an  ensemble  in  which  the  number  of 
systems  haying  v1, . . .  vh  particles  of  the  several  kinds,  and 
having  values  of  their  coordinates  and  momenta  lying  between 
the  limits  qt  and  j3  +  dqx,  px  and  pt  +  dpl$  etc.,  is  represented 
by  the  expression 

Ne  e 

[^...[3         *Pl---*tn,  («8) 

where  N>  ft,  0,  fil9 . . .  fih  are  constants,  N  denoting  the  total 
number  of  systems  in  the  ensemble.     The  expression 

Ne  e  (499) 

evidently  represents  the  density-in-phase  of  the  ensemble 
within  the  limits  described,  that  is,  for  a  phase  specifically 
defined.    The  expression 

'  e  (500) 

Is-  ••la 
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is  therefore  the  probability-coefficient  for  a  phase  specifically 
defined.  This  has  evidently  the  same  value  for  all  the 
|»l . . .  \r±  phases  obtained  by  interchanging  the  phases  of 
particles  of  the  same  kind.  The  probability-coefficient  for  a 
generic  phase  will  be  |n . » « K  times  as  great,  vis^ 

«  •  (501) 

We  shall  say  that  such  an  ensemble  as  has  been  described 
is  canonically  dittribuUd,  and  shall  call  the  constant  8  its 
modulus.  It  is  evidently  what  we  have  called  a  grand  ensem- 
ble. The  petit  ensembles  of  which  it  is  composed  are 
canonically  distributed,  according  to  the  definitions  of  Chapter 
IV,  since  the  expression 

*b.*b  (602) 

is  constant  for  each  petit  ensemble.  The  grand  ensemble, 
therefore,  is  in  statistical  equilibrium  with  respect  to  specific 


If  an  ensemble,  whether  grand  or  petit,  is  identical  so  far 
as  generic  phases  are  concerned  with  one  canonically  distrib- 
uted, we  shall  say  that  its  distribution  is  canonical  with 
respect  to  generic  phases.  Such  an  ensemble  is  evidently  in 
statistical  equilibrium  with  respect  to  generic  phases,  although 
it  may  not  be  so  with  respect  to  specific  phases. 

If  we  write  H  for  the  index  of  probability  of  a  generic  phase 
in  a  grand  ensemble,  we  have  for  the  case  of  canonical 
distribution 

H  =  o  +  Ml„.. .  +  /*»*»-«  (603) 

99 

It  will  be  observed  that  the  H  is  a  linear  function  of  e  and 
vv  . . .  vh ;  also  that  whenever  the  index  of  probability  of 
generic  phases  in  a  grand  ensemble  is  a  linear  function  of 
€,  vv  .  .  .  vk,  the  ensemble  is  canonically  distributed  with 
respect  to  generic  phases. 
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The  constant  Q  we  may  regard  as  determined  by  the 
equation 

^-^-^^•/^la-'la     *"•*"     (604) 

or 

4  *  =  ^...%Pk*—?—f...fe  *dPl...dqm,    (505) 

where  the  multiple  sum  indicated  by  2Fl  • . .  2n  includes  all 
terms  obtained  by  giving  to  each  of  the  symbols  vi . . .  vh  all 
integral  values  from  zero  upward,  and  the  multiple  integral 
(which  is  to  be  evaluated  separately  for  each  term  of  the 
multiple  sum)  is  to  be  extended  over  all  the  (specific)  phases 
of  the  system  having  the  specified  numbers  of  particles  of  the 
various  kinds.    The  multiple  integral  in  the  last  equation  is 

Jt 
what  we  have  represented  by  e  •.    See  equation  (92).    We 
may  therefore  write 

Mi»i . . .  innr+ 

—  b • 

a  •  =  2r1...2rIn , (506) 

It  should  be  observed  that  the  summation  includes  a  term 
in  which  all  the  symbols  vt . .  .  vh  have  the  value  zero.  We 
must  therefore  recognize  in  a  certain  sense  a  system  consisting 
of  no  particles,  which,  although  a  barren  subject  of  study  in 
itself,  cannot  well  be  excluded  as  a  particular  case  of  a  system 
of  a  variable  number  of  particles.  In  this  case  e  is  constant, 
and  there  are  no  integrations  to  be  performed.  We  have 
therefore* 

_*       -I 
e  e  =s  e  e,    i.  e.f    $  =  c. 

*  This  conclusion  may  appear  a  little  strained.  The  original  definition 
of  f  may  not  be  regarded  as  fairly  applying  to  systems  of  no  degrees  of 
freedom.  We  may  therefore  prefer  to  regard  these  equations  as  denning 
t  in  this  case. 
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The  value  of  tp  is  of  course  zero  in  this  case.  But  the 
value  of  eq  contains  an  arbitrary  constant,  which  is  generally 
determined  by  considerations  of  convenience,  so  that  eq  and  e 
do  not  necessarily  vanish  with  vt , .  • .  vh. 

Unless  —  O  has  a  finite  value,  our  formulae  beoome  illusory. 
We  have  already,  in  considering  petit  ensembles  canonically 
distributed,  found  it  necessary  to  exclude  cases  in  which  —  ^ 
has  not  a  finite  value.*  The  same  exclusion  would  here 
make  —  ^  finite  for  any  finite  values  of  vx . . .  vk.  This  does 
not  necessarily  make  a  multiple  series  of  the  form  (506)  finite. 
We  may  observe,  however,  that  if  for  all  values  of  vx  . . .  vh 

-^Sc0  +  clylf...  +  ehvk,  (607) 

where  <?0,  cv  . . .  ch  are  constants  or  functions  of  0, 


t.  *« 


t.  «., 


fl                                   CorfOh+Ci)^ . . .  +(/»A+«jk)rjk 

a    8  <  5            5    e 

e 

6       -  **i  •  * '  *n 

In-  •  •[** 

0         <  6      2          i 

838              "l       |*i 

*  • 

••*»   bi 

a       c0    #H-ta 

-?<*>  +  «     *    . 

..  +  «    e 

The  value  of  —  XI  will  therefore  be  finite,  when  the  condition 
(607)  is  satisfied.  If  therefore  we  assume  that  —  XI  is  finite, 
we  do  not  appear  to  exclude  any  cases  which  are  analogous  to 
those  of  nature,  t 

The  interest  of  the  ensemble  which  has  been  described  lies 
in  the  fact  that  it  may  be  in  statistical  equilbrium,  both  in 

•  See  Chapter  IV,  page  86. 

t  If  the  external  coordinates  determine  a  certain  rolume  within  which  the 
system  is  confined,  the  contrary  of  (607)  would  imply  that  we  could  obtain 
an  infinite  amount  of  work  by  crowding  an  infinite  quantity  of  matter  into  a 
finite  rolume. 
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respect  to  exchange  of  energy  and  exchange  of  particles,  with 
other  grand  ensembles  canonically  distributed  and  haying  the 
same  values  of  8  and  of  the  coefficients  jjlv  ^,  etc.,  when  the 
circumstances  are  such  that  exchange  of  energy  and  of 
particles  are  possible,  and  when  equilibrium  would  not  sub- 
sist, were  it  not  for  equal  values  of  these  constants  in  the  two 
ensembles. 

With  respect  to  the  exchange  of  energy,  the  case  is  exactly 
the  same  as  that  of  the  petit  ensembles  considered  in  Chapter 
IV,  and  needs  no  especial  discussion.  The  question  of  ex- 
change of  particles  is  to  a  certain  extent  analogous,  and  may 
be  treated  in  a  somewhat  similar  manner.  Let  us  suppose 
that  we  have  two  grand  ensembles  canonically  distributed 
with  respect  to  specific  phases,  with  the  same  value  of  the 
modulus  and  of  the  coefficients  /^  . . .  ph ,  and  let  us  consider 
the  ensemble  of  all  the  systems  obtained  by  combining  each 
system  of  the  first  ensemble  with  each  of  the  second. 

The  probability-coefficient  of  a  generic  phase  in  the  first 
ensemble  may  be  expressed  by 

e  e  (609) 

The  probability-coefficient  of  a  specific  phase  will  then  be 
expressed  by 

o/+mi*i/.--+>W-«/ 

e 


K 


(510) 


since  each  generic  phase  comprises  1 1^ . . .  (j^  specific  phases. 
In  the  second  ensemble  the  probability-coefficients  of  the 
generic  and  specific  phases  will  be 

e  e  ,  (511) 


"*  W' M> (612) 

iyi  •  •  • \yh 
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The  probability-coefficient  of  a  generic  phase  in  the  third 
ensemble,  which  consists  of  systems  obtained  by  regarding 
each  system  of  the  first  ensemble  combined  with  each  of  the 
second  as  forming  a  system,  will  be  the  product  of  the  proba- 
bility-coefficients of  the  generic  phases  of  the  systems  com- 
bined, and  will  therefore  be  represented  by  the  formula 

e  e  (513) 

where  fl'"  =  O'  +  fl",  c"'  =  4  +  c",  vxw  =  *'  <r  *",  etc-  It 
will  be  observed  that  V,  etc.,  represent  the  numbers  of 
particles  of  the  various  kinds  in  the  third  ensemble,  and  e"' 
its  energy ;  also  that  Om  is  a  constant.  The  third  ensemble 
is  therefore  canonically  distributed  with  respect  to  generic 


If  all  the  systems  in  the  same  generic  phase  in  the  third 
ensemble  were  equably  distributed  among  the  \v\llf . . .  \vhtn  spe- 
cific phases  which  are  comprised  in  the  generic  phase,  the  prob- 
ability-coefficient of  a  specific  phase  would  be 

laC-laC (514) 

In  fact,  however,  the  probability-coefficient  of  any  specific 
phase  which  occurs  in  the  third  ensemble  is 

4  * 


which  we  get  by  multiplying  the  probability-coefficients  of 
specific  phases  in  the  first  and  second  ensembles.  The  differ- 
ence between  the  formulae  (514)  and  (515)  is  due  to  the  fact 
that  the  generic  phases  to  which  (513)  relates  include  not 
only  the  specific  phases  occurring  in  the  third  ensemble  and 
having  the  probability-coefficient  (515),  but  also  all  the 
specific  phases  obtained  from  these  by  interchange  of  similar 
particles  between  two  combined  systems.    Of  these  the  proba- 
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bility-coefficient  is  evidently  zero,  as  they  do  not  occur  in  the 
ensemble. 

Now  this  third  ensemble  is  in  statistical  equilibrium,  with 
respect  both  to  specific  and  generic  phases,  since  the  ensembles 
from  which  it  is  formed  are  so.  This  statistical  equilibrium 
is  not  dependent  on  the  equality  of  the  modulus  and  the  co-effi- 
cients fix, . . .  fik  in  the  first  and  second  ensembles.  It  depends 
only  on  the  fact  that  the  two  original  ensembles  were  separ- 
ately in  statistical  equilibrium,  and  that  there  is  no  interaction 
between  them,  the  combining  of  the  two  ensembles  to  form  a 
third  being  purely  nominal,  and  involving  no  physical  connec- 
tion. This  independence  of  the  systems,  determined  physically 
by  forces  which  prevent  particles  from  passing  from  one  sys- 
tem to  the  other,  or  coming  within  range  of  each  other's  action, 
is  represented  mathematically  by  infinite  values  of  the  energy 
for  particles  in  a  space  dividing  the  systems.  Such  a  space 
may  be  called  a  diaphragm. 

If  we  now  suppose  that,  when  we  combine  the  systems  of 
the  two  original  ensembles,  the  forces  are  so  modified  that  the 
energy  is  no  longer  infinite  for  particles  in  all  the  space  form- 
ing the  diaphragm,  but  is  diminished  in  a  part  of  this  space, 
so  that  it  is  possible  for  particles  to  pass  from  one  system 
to  the  other,  this  will  involve  a  change  in  the  function  e"' 
which  represents  the  energy  of  the  combined  systems,  and  the 
equation  d"  =  e'  +  c"  will  no  longer  hold.  Now  if  the  co- 
efficient of  probability  in  the  third  ensemble  were  represented 
by  (513)  with  this  new  function  c"',  we  should  have  statistical 
equilibrium,  with  respect  to  generic  phases,  although  not  to 
specific.  But  this  need  involve  only  a  trifling  change  in  the 
distribution  of  the  third  ensemble,*  a  change  represented  by 
the  addition  of  comparatively  few  systems  in  which  the  trans- 
ference of  particles  is  taking  place  to  the  immense  number 

*  It  wiU  be  observed  that,  to  iar  as  the  distribution  is  concerned,  rery 
large  and  infinite  rallies  of  c  (for  certain  phases)  amount  to  nearly  the  same 
thing,— one  representing  the  total  and  the  other  the  nearly  total  exclusion 
of  the  phases  in  question.  An  infinite  change,  therefore,  in  the  value  of  « 
(for  certain  phases)  may  represent  a  vanishing  change  in  the  distribution. 
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obtained  by  combining  the,  two  original  ensembles.  The 
difference  between  the  ensemble  which  would  be  in  statistical 
equilibrium,  and  that  obtained  by  combining  the  two  original 
ensembles  may  be  diminished  without  limit,  while  it  is  still 
possible  for  particles  to  pass  from  one  system  to  another.  In 
this  sense  we  may  say  that  the  ensemble  formed  by  combining 
the  two  given  ensembles  may  still  be  regarded  as  in  a  state  of 
(approximate)  statistical  equilibrium  with^'respect  to  generic 
phases,  when  it  has  been  made  possible  f  0r  particles  to  pass 
between  the  systems  combined,  and  when  statistical  equilibrium 
for  specific  phases  has  therefore  entirely  ceased  to  exist,  and 
when  the  equilibrium  for  generic  phases  would  also  have 
entirely  ceased  to  exist,  if  the  given  ensembles  had  not  been 
canonically  distributed,  with  respect  to  generic  phases,  with 
the  same  values  of  ©  and  nv  . . .  fik. 

It  is  evident  also  that  considerations  of  this  kind  will  apply 
separately  to  the  several  kinds  of  particles.  We  may  diminish 
the  energy  in  the  space  forming  the  diaphragm  for  one  kind  of 
particle  and  not  for  another.  This  is  the  mathematical  ex- 
pression for  a  "  semipermeable"  diaphragm.  The  condition 
necessary  for  statistical  equilibrium  where  the  diaphragm  is 
permeable  only  to  particles  to  which  the  suffix  (  )x  relates 
will  be  fulfilled  when  ^  and  8  have  the  same  values  in  the 
two  ensembles,  although  the  other  coefficients  y*,,  etc.,  may  be 
different. 

This  important  property  of  grand  ensembles  with  canonical 
distribution  will  supply  the  motive  for  a  more  particular  ex- 
amination of  the  nature  of  such  ensembles,  and  especially  of 
the  comparative  numbers  of  systems  in  the  several  petit  en- 
sembles which  make  up  a  grand  ensemble,  and  of  the  average 
values  in  the  grand  ensemble  of  some  of  the  most  important 
quantities,  and  of  the  average  squares  of  the  deviations  from 
these  average  values. 

The  probability  that  a  system  taken  at  random  from  a 
grand  ensemble  canonically  distributed  will  have  exactly 
vv . . .  v%  particles  of  the  various  kinds  is  expressed  by  the 
multiple  integral 
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f"f  la.* la     *-*•  (616) 

phtm 

n+/t1r1...+Wy-» 

or  *  8  .  (517) 

la--  *la 

This  may  be  called  the  probability  of  the  petit  ensemble 
(?i*  •  •  •  "*)•  The  sum  of  all  such  probabilities  is  evidently 
unity.    That  is, 

X, . . .  Xk  *-, ^ =  1,  (518) 

which  agrees  with  (506). 

The  average  value  in  the  grand  ensemble  of  any  quantity 
u,  is  given  by  the  formula 

•a         g 

u  =  lrt...X,hf...fweh       ^        dPl . .  .<*?..  (519) 


If  it  is  a  function  of  Flf . . .  v»  alone, t.  *.,  if  it  has  the  same 
value  in  all  systems  of  any  same  petit  ensemble,  the  formula 
reduces  to 


ue 


«  =  **•••** — la---la. —  <520> 

Again,  if  we  write  ujgnad  and  ufp^  to  distinguish  averages  in 
the  grand  and  petit  ensembles,  we  shall  have 

In  this  chapter,  in  which  we  are  treating  of  grand  en- 
sembles, u  will  always  denote  the  average  for  a  grand  en- 
semble. In  the  preceding  chapters,  u  has  always  denoted 
the  average  for  a  petit  ensemble. 
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Equation  (605),  which  we  repeat  in  a  slightly  different 
form,  viz., 

<  •  =  1^...%hf...fl- j dpi...^.,  (522) 

showB  that  A  is  a  function  of  8  and  ^  ...  pk;  also  of  the 
external  coordinates  Oj,  o^  etc,  which  are  involved  implicitly 
in  6.  If  we  differentiate  the  equation  regarding  all  these 
quantities  as  variable,  we  have 


(-?+**)- 


an 


[n  •  •  •  K 


+  e 


phaata 


+  etc. 


all  jj 


-  etc.  (523) 

n 
If  we  multiply  this  equation  by  ee,  and  set  as  usual  Av  A^ 
etc.,  for  —  cle/cfaj,  —  dejda^,  etc.,  we  get  in  virtue  of  the  law 
expressed  by  equation  (519), 

da  t  o  .  _        c» ,   -        t     .     - 

+  ^*  +  *5*  +  eto.|  (524) 
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that  is, 

Since  equation  (608)  gives 

the  preceding  equation  may  be  written 

dQ  a  KdB  -  ln^Mi  -  S lid*.  (527) 

Again,  equation  (526)  gives 

<*Q  + Jim^i  +  Sn^i-de=x#dH  +  HJ§.  (528) 
Eliminating  <ftl  from  these  equations,  we  get 

£  =  -  #rf H  +  S^^  -  S -*!  Ah.  (529) 

If  we  set  t=i  +  eH,  (690) 

#*  =  «fc  +  #<*H  +  Hd&,  (581) 

we  have  c»  =  H*  +  S^d^  — 3Jt*i.  (682) 

The  corresponding  thermodynamic  equations  are 

*=r<^  +  Vi*»i-34*if  (583) 

^  =  e-r^  (534) 

d^rx-^r+a^dnH-S^Ah.  (535) 

These  are  derived  from  the  thermodynamic  equations  (114) 
and  (117)  by  the  addition  of  the  terms  neoessary  to  take  ao- 
oount  of  variation  in  the  quantities  (m^  04,  etc.)  of  the 
several  substances  of  which  a  body  is  composed.  The  cor- 
respondence of  the  equations  is  most  perfect  when  the  com- 
ponent substances  are  measured  in  such  units  that  mv  114, 
etc.,  are  proportional  to  the  numbers  of  the  different  kinds 
of  molecules  or  atoms.  The  quantities  ^p  Ps*  ©to.,  in  these 
thermodynamic  equations  may  be  defined  as  differential  coeffi- 
cients by  either  of  the  equations  in  which  they  occur.* 

•  Compare  Transaction!  Connsoticmt  Academy,  Vol  m,  pages  116  ft 
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If  we  compare  the  statistical  equations  (529)  and  (532) 
with  (114)  and  (112),  which  are  given  in  Chapter  IV,  and 
discussed  in  Chapter  XIV,  as  analogues  of  thermody- 
namic equations,  we  find  considerable  difference.  Beside  the 
terms  corresponding  to  the  additional  terms  in  the  thermo- 
dynamic equations  of  this  chapter,  and  beside  the  fact  that 
the  averages  are  taken  in  a  grand  ensemble  in  one  case 
and  in  a  petit  in  the  other,  the  analogues  of  entropy,  H 
and  ?,  are  quite  different  in  definition  and  value.  We  shall 
return  to  this  point  after  we  have  determined  the  order 
of  magnitude  of  the  usual  anomalies  of  *v  ...  r* 

If  we  differentiate  equation  (518)  with  respect  to  jh,  and 
multiply  by  8,  we  get 

whence  dQ/dfj^  =  —  vv  which  agrees  with  (527).    Differen- 
tiating again  with  respect  to  /ij,  and  to  /*,,  and  setting 


da  _      -      dO  _      - 
d^  —  ~    SR""^ 


we  get 


o+iv*— +#w-* 


*~M&*iFh^s— * 


(587) 


o+wv  •+**»*-+ 


(638) 


The  first  members  of  these  equations  represent  the  average 
values  of  the  quantities  in  the  principal  parentheses.  We 
have  therefore 


<«J     ttrfn 


fr-^-jp-y.-.—  =  eg,  (539) 

5=5S=5-n*-«— •^-•^-•g.  (MO) 
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From  equation  (539)  we  may  get  an  idea  of  the  order  of 
magnitude  of  the  divergences  of  vx  from  its  average  value 
in  the  ensemble,  when  that  average  value  is  great.  The 
equation  may  be  written 

fe^2?.»*  (641) 

The  second  member  of  this  equation  will  in  general  be  small 
when  vt  is  great.  Large  values  are  not  necessarily  excluded, 
but  they  must  be  confined  within  very  small  limits  with  re- 
spect to  p.    For  if 

vx  nf 

for  all  values  of  ^  between  the  limits  /*/  and  14",  we  shall 
have  between  the  same  limits 


and  therefore 


*e(^"^)>M1""/H'-  (644) 

The  difference  /*/'  —  ptf  is  therefore  numerically  a  veiy  small 
quantity.  To  form  an  idea  of  the  importance  of  such  a 
difference,  we  should  observe  that  in  formula  (498)  /*i  is 
multiplied  by  vx  and  the  product  subtracted  from  the  energy. 
A  very  small  difference  in  the  value  of  /^  may  therefore  be  im- 
portant. But  since  vS  is  always  less  than  the  kinetic  energy 
of  the  system,  our  formula  shows  that  /*/'  —  /a/,  even  when 
multiplied  by  vx'  or  v^',  may  still  be  regarded  as  an  insensible 
quantity. 

We  can  now  perceive  the  leading  characteristics  with  re- 
spect to  properties  sensible  to  human  faculties  of  such  an  en- 
semble as  we  are  considering  (a  grand  ensemble  canonically 
distributed),  when  the  average  numbers  of  particles  of  the  vari- 
ous kinds  are  of  the  same  order  of  magnitude  as  the  number 
of  molecules  in  the  bodies  which  are  the  subject  of  physical 
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experiment.  Although  the  ensemble  contains  systems  having 
the  widest  possible  variations  in  respect  to  the  numbers  of 
the  particles  which  they  contain,  these  variations  are  practi- 
cally contained  within  such  narrow  limits  as  to  be  insensible, 
except  for  particular  values  of  the  constants  of  the  ensemble. 
This  exception  corresponds  precisely  to  the  case  of  nature, 
when  certain  thermodynamic  quantities  corresponding  to  6, 
fih  fir  etc.,  which  in  general  determine  the  separate  densities 
of  various  components  of  a  body,  have  certain  values  which 
make  these  densities  indeterminate,  in  other  words,  when  the 
conditions  are  such  as  determine  coexistent  phases  of  matter. 
Except  in  the  case  of  these  particular  values,  the  grand  en- 
semble would  not  differ  to  human  faculties  of  perception  from 
a  petit  ensemble,  viz.,  any  one  of  the  petit  ensembles  which  it 
contains  in  which  vu  y*  etc.,  do  not  sensibly  differ  from  their 
average  values. 

Let  us  now  compare  the  quantities  H  and  17,  the  average 
values  of  which  (in  a  grand  and  a  petit  ensemble  respectively) 
we  have  seen  to  correspond  to  entropy.    Since 

O  +  Min  > . .  +  j^r*  —  c 
H  = g > 

and  *  =  !fLiL!' 

H-?  =  n  +  *yi  —  +  '<»v»~*.  (646) 

A  part  of  this  difference  is  due  to  the  fact  that  H  relates  to 
generic  phases  and  i\  to  specific.  If  we  write  1;^  for  the 
index  of  probability  for  generic  phases  in  a  petit  ensemble, 
we  have 

i?t«  =  i?  +  loglvj*--b>  C646) 

H  -  v  =  H  -  Vna  +  log  [n  . . .  [v* ,  (647) 

H  —  ifce*  = -g log  l^. .  .  [v*.      (548) 

This  is  the  logarithm  of  the  probability  of  the  petit  en* 
semble  (vx . . .  vh) .*    If  we  set 

•  See  formula  (517). 
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^=J  =  W  (M9) 

which  corresponds  to  the  equation 

*?-*  (SK>) 

we  have  ^9m  =  ^  +  ©  log  (vi . . .  \v^f 

and  H-lp.»0  +  ftn"^+w"^-.  (651) 

This  will  have  a  maximum  when* 

Distinguishing  values  corresponding  to  this  maximum  by 
accents,  we  have  approximately,  when  vx , .  • .  vh  are  of  the 
same  order  of  magnitude  as  the  numbers  of  molecules  in  ordi- 
nary bodies, 

O  +  /W  *  -  «  +  /W  —  fas/ 

e 

_  /gVeVCA^       /rfV^VA^Av,  /tfy^YCAvQ* 

\^!/    2e      \dvrfvj     e    ••'    \dvk*J    2%  ' 

(553) 

e*-^9»  _  e<7e    V  rfrx*  /     2e      Vrf^rfr,/       e      ' "     Vrf^V  "25" 

'(554) 

where  (7  =  °  +  *^  : : : '  + »»'  =  »-',  (555) 

and  Af1  =  v1  —  iV,        Avi  =  h-Vi        etc.  (566) 

This  is  the  probability  of  the  system  (i^  . . .  i^).  The  prob- 
abilty  that  the  values  of  vt , . . .  vh  lie  within  given  limits  is 
given  by  the  multiple  integral 

*  Strictly  speaking,  f  it  not  determined  as  function  of  rlf . . .  rA,  except 
for  integral  Tallies  of  these  Tariables.  Yet  we  may  suppose  it  to  be  deter- 
mined as  a  continuous  function  by  any  suitable  process  of  interpolation. 
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SOS 


J      J  (667) 

This  shows  that  the  distribution  of  the  grand  ensemble  with 
respect  to  the  values  of  vv. . .  vk  follows  the  " law  of  errors " 
when  pt\  . . .  vj  are  very  great  The  value  of  this  integral 
for  the  limits  ±  oo  should  be  unity*    This  gives 


or 


(7  =  ilogi)-^log(2*e), 


(556) 
(N») 


when     D: 


ft**)' 


(560) 


\*<iV      \dndvt)  "■ 

V*>*j  v*.*J v*»* ; 

ffcY    f^Y  feY 

WJ     \*»J \d»J 

(dj^V        /djH\  (dj^V 

V<*n/       \dvt) \dvk) 

(d^y     fd^y  fdj^y 

\dnj         \~dyj VrfvJ 

Now,  by  (558),  we  have  for  the  first  approximation 

fl-^-  =  tf=*logl>-§log(2,r®),  (662) 

and  if  we  divide  by  the  constant  K,*  to  reduce  these  quanti- 
ties to  the  usual  unit  of  entropy, 


that  is,    D  = 


(561) 


fl-^  __  log  D  -  h  log  (2*6) 
•  See  page  184-186. 


(663) 
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This  is  evidently  a  negligible  quantity,  since  K  is  of  the  same 
order  of  magnitude  as  the  number  of  molecules  in  ordinary 
bodies.  It  is  to  be  observed  that  fj9m  is  here  the  average  in 
the  grand  ensemble,  whereas  the  quantity  which  we  wish  to 
compare  with  H  is  the  average  in  a  petit  ensemble.  But  as  we 
have  seen  that  in  the  case  considered  the  grand  ensemble  would 
appear  to  human  observation  as  a  petit  ensemble,  this  dis- 
tinction may  be  neglected. 

The  differences  therefore,  in  the  case  considered,  between  the 
quantities  which  may  be  represented  by  the  notations  * 

are  not  sensible  to  human  faculties.    The  difference 

^-U»  ■"  i7^Ut  =  la.-  •  •  fa> 

and  is  therefore  constant,  so  long  as  the  numbers  vly .  . .  vh 
are  constant.  For  constant  values  of  these  numbers,  therefore, 
it  is  immaterial  whether  we  use  the  average  of  f/wm  or  of  i\  for 
entropy,  since  this  only  affects  the  arbitrary  constant  of  in- 
tegration which  is  added  to  entropy.  But  when  the  numbers 
vv  .  .  .  vh  are  varied,  it  is  no  longer  possible  to  use  the  index 
for  specific  phases.  For  the  principle  that  the  entropy  of  any 
body  has  an  arbitrary  additive  constant  is  subject  to  limi- 
tation, when  different  quantities  of  the  same  substance  are 
concerned.  In  this  case,  the  constant  being  determined  for 
one  quantity  of  a  substance,  is  thereby  determined  for  all 
quantities  of  the  same  substance. 

To  fix  our  ideas,  let  us  suppose  that  we  have  two  identical 
fluid  masses  in  contiguous  chambers.  The  entropy  of  the 
whole  is  equal  to  the  sum  of  the  entropies  of  the  parts,  and 
double  that  of  one  part.  Suppose  a  valve  is  now  opened, 
making  a  communication  between  the  chambers.  We  do  not 
regard  this  as  making  any  change  in  the  entropy,  although 
the  masses  of  gas  or  liquid  diffuse  into  one  another,  and  al- 
though the  same  process  of  diffusion  would  increase  the 

*  In  this  paragraph,  for  greater  distinctness,  fagw^a  and  ^1  m  hare 
been  written  for  the  quantities  which  elsewhere  are  denoted  by  H  and  * 
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entropy,  if  the  masses  of  fluid  were  different.  It  is  evident, 
therefore,  that  it  is  equilibrium  with  respect  to  generic  phases, 
and  not  with  respect  to  specific,  with  which  we  have  to  do  in 
the  evaluation  of  entropy,  and  therefore,  that  we  must  use 
the  average  of  H  or  of  17^.,  and  not  that  of  17,  as  the  equiva- 
lent of  entropy,  except  in  the  thermodynamics  of  bodies  in 
which  the  number  of  molecules  of  the  various  kinds  is 
constant 
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PREFATORY  NOTE  TO  PART  TWO 

In  a  few  cases  slight  corrections  had  been  made  by  the  author 
in  his  own  copies  of  the  papers.  These  changes,  together  with  the 
correction  of  obvious  misprints  in  the  originals,  have  been  incor- 
porated in  the  present  edition  without  comment.  Where  for  the 
sake  of  clearness  it  has  seemed  desirable  to  insert  a  word  or  two 
in  a  footnote  or  in  the  text  itself,  the  addition  has  been  indicated 
by  enclosing  it  within  square  brackets  [  ],  a  sign  which  is  otherwise 
used  only  in  the  formulae. 
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[American  Journal  of  Mathematics,  vol.  il  pp.  49-64,  1879.] 

Formation  of  a  new  Indeterminate  Formula  of  Motion  by  the  Sub- 
stitution of  the  Variations  of  the  Components  of  Acceleration  for 
the  Variations  of  the  Coordinates  in  the  usual  Formula. 

The  laws  of  motion  are  frequently  expressed  by  an  equation  of  the 
form  2[(X-rna)te+(Y-my)fy+(Z-mS)Sz]=Q,  (1) 

in  which 

m  denotes  the  mass  of  a  particle  of  the  system  considered, 
x,  y,  z  its  rectangular  coordinates, 
£,  $,  2  the  second  differential  coefficients  of  the  coordinates  with 

respect  to  the  time, 
X,  Y,  Z  the  components  of  the  forces  acting  on  the  particle, 
&,  Sy,  Sz  any   arbitrary   variations  of  the  coordinates   which   are 
simultaneously  possible,  and 
2  a  summation  with  respect  to  all  the  particles  of  the  system. 
It  is  evident  that  we  may  substitute  for  Sx,  &y,  Sz  any  other 
expressions  which  are  capable  of  the  same  and  only  of  the  same  sets 
of  simultaneous  values. 

Now  if  the  nature  of  the  system  is  such  that  certain  functions 
A,  B,  etc  of  the  coordinates  must  be  constant,  or  given  functions  of 
the  time,  we  have 

*(£*+£*+£*)-* ' 

etc. 
These  are  the  equations  of  condition,  to  which  the  variations  in 
the  general  equation  of  motion  (1)  are  subject    But  if  il  is  constant 
or  a  determined  function  of  the  time,  the  same  must  be  true  of 

A  and  A.    Now 

A     ^fdA  .  tdA  .^dA,\ 
A-*\a%X+a^y+lTz*) 


and 


*-*<£*+%il+%')+B- 
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where  H  represents  terms  containing  only  the  second  dMbrsnrisl 
coefficients  of  A  with  respeet  to  the  coordinates,  sad  the  irst  differ- 
ential coefficients  of  the  coordinate*  with  respeet  to  the  toe.  There- 
fore, if  we  conceive  of  a  variation  affecting  the  accelerations  of  the 
particles  at  the  time  considered,  but  not  their  positions  or  velocities, 
we  have 

and,  in  like  manner, 

etc 

Comparing  these  equations  with  (2),  we  see  that  when  the  ooesfora- 
tions  of  the  particles  are  regarded  as  subject  to  the  variation  denoted 
by  S,  but  not  their  positions  or  velocities,  the  possible  values  of  4&,  &j), 
Si  are  subject  to  precisely  the  same  restrictions  as  the  values  of  tm, 
Sy,  Sz,  when  the  poritions  of  the  particles  are  regarded  as  variable. 
We  may,  therefore,  write  for  the  general  equation  of  motion 

2[(X-rna)8£+(Y-my)Sil+(Z-rni)S*]-Q,  (4) 

regarding  the  positions  and  velocities  of  the  particles  as  unaffected  by 
the  variation  denoted  by  S, — a  condition  which  may  be  expressed  by 
the  equations  &.^0         Sytm0f        fc«0,| 

<&«0,        <Sy«0,        &-0.J 

We  have  so  far  supposed  that  the  conditions  which  restrict  the 
possible  motions  of  the  systems  may  be  expressed  by  equation* 
between  the  coordinates  alone  or  the  coordinates  and  the  time.  To 
extend  the  formula  of  motion  to  cases  in  which  the  conditions  ai* 
expressed  by  the  characters  g  or  S,  we  may  write 

2[(X-ma)Sx^Y-my)8f+(Z-<mM)8*]£0.  (6) 

The  conditions  which  determine  the  possible  values  of  6&,  Sjf,  Si 
will  not,  in  such  cases,  be  entirely  similar  to  those  which  determine 
the  possible  values  of  Sx,  Sy,  Sz,  when  the  coordinates  are  regarded  as 
variable.  Nevertheless,  the  laws  of  motion  are  correctly  expressed 
by  the  formula  (6),  while  the  formula 

2[(X-mx)Sx+(Y-<m,y)Sy+{£-mi)Sz)£0t  (7) 

does  not,  as  naturally  interpreted,  give  so  complete  and  accurate  an 
expression  of  the  laws  of  motion. 

This  may  be  illustrated  by  a  simple  example. 

Let  it  be  required  to  find  the  acceleration  of  a  material  point, 
which,  at  a  given  instant,  is  moving  with  given  velocity  on  the 


Digitized  by  VjOOQIC 


ON  THE  FUNDAMENTAL  FORMULJB  OF  DYNAMICS.  3 

frictionless  surface  of  *  body  (which  it  cannot  penetrate,  but  which 
it  may  leave),  and  is  acted  on  by  given  forces.  For  simplicity,  we 
may  suppose  that  the  normal  to  the  surface,  drawn  outward  from  the 
moving  point  at  the  moment  considered,  is  parallel  to  the  axis  of  X 
and  in  the  positive  direction.  The  only  restriction  on  the  values  of 
&,fy,**isthat  &>0. 

Formula  (7)  will  therefore  give 

-s*  f.l.  ,.i. 

The  condition  that  the  point  shall  not  penetrate  the  body  gives 
another  condition  for  the  value  of  &  If  the  point  remains  upon  the 
surface,  x  must  have  a  certain  value  N,  determined  by  the  form  of 
the  surface  and  the  velocity  of  the  point  If  the  value  of  *  is  less 
than  this,  the  point  must  penetrate  the  body.    Therefore, 

But  this  does  not  suffice  to  determine  the  acceleration  of  the  point 

Let  us  now  apply  formula  (6)  to  the  same  problem.    Since  #  cannot 
UlemtimnN,  .f  ^^    ^Q 

This  is  the  only  restriction  on  the  value  of  S&t  for  ii  x>N,  the 

value  of  S&  is  entirely  arbitrary.    Formula  (6),  therefore,  requires 

that  Y 

ifa»#    *>-; 
m 

— that  is  (since  #  cannot  be  less  than  N)t  that  #  shall  be  equal  to  the 
greater  o: 
and  that 


X 
greater  of  the  quantities  N  and  — ,  or  to  both,  if  they  are  equal, — 


Y  Z 

9    m  m 

The  values  of  x,  y,  %  are  therefore  entirely  determined  by  this 
formula  in  connection  with  the  conditions  afforded  by  the  constraints 
of  the  system* 

The  following  considerations  will  show  that  what  is  true  in  this 
case  is  also  true  in  general,  when  the  conditions  to  which  the  system 

•The  failure  of  the  formula  (7)  in  this  oaae  ii  rather  apparent  than  real;   for, 
although  the  formula  apparently  allows  to  xt  at  the  instant  oonddered,  a  value 

exceeding  both  JVand  — ,  it  does  not  allow  this  for  any  interval,  however  short.    For 

if  i<N,  the  point  will  immediately  leave  the  surfaoe,  and  then  the  formula  requires 

that*** 
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is  subject  are  such  that  certain  functions  of  the  coordinates  cannot 
exceed  certain  limits,  either  constant  or  variable  with  the  time.  If 
certain  values  of  Sx,  S$,  S2  (with  unvaried  values  of  x,  y,  zt  and  x,  y,  z) 
are  simultaneously  possible  at  a  given  instant,  equal  or  proportional 
values  with  the  same  signs  must  be  possible  for  8x,  Sy,  Sz  immediately 
after  the  instant  considered,  and  must  satisfy  formula  (1),  and  there- 
fore (6),  in  connection  with  the  values  of  £,  i),  2,  X,  Y,  Z  immediately 
after  that  instant  The  values  of  £,  $,  8,  thus  determined,  are  of 
course  the  very  quantities  which  we  wish  to  obtain,  since  the  accelera- 
tion of  a  point  at  a  given  instant  does  not  denote  anything  different 
from  its  acceleration  immediately  after  that  instant. 

For  an  example  of  a  somewhat  different  class  of  cases,  we  may 
suppose  that  in  a  system,  otherwise  free,  x  cannot  have  a  negative 
value.  Such  a  condition  does  not  seem  to  affect  the  possible  values 
of  Sx,  as  naturally  interpreted  in  a  dynamical  problem.  Yet,  if  we 
should  regard  the  value  of  Sx  in  (7)  as  arbitrary,  we  should  obtain 

m 

which  might  be  erroneous.  But  if  we  regard  Sx  as  expressing  a 
velocity  of  which  the  system,  if  at  rest,  would  be  capable  (which  is 
not  a  natural  signification  of  the  expression),  we  should  have  Sx^O, 
which,  with  (7),  gives  _ 

m 

This  is  not  incorrect,  but  it  leaves  the  acceleration  undetermined. 
If  we  should  regard  Sx  as  denoting  such  a  variation  of  the  velocity 
as  is  possible  for  the  system  when  it  has  its  given  velocity  (this  also 
is  not  a  natural  signification  of  the  expression),  formula  (7)  would 
give  the  correct  value  of  &  except  when  <c=0.  In  this  case  (which 
cannot  be  regarded  as  exceptional  in  a  problem  of  this  kind),  we 
should  have  Sx^O,  which  will  leave  #  undetermined,  as  before. 

The  application  of  formula  (6),  in  problems  of  this  kind,  presents 
no  difficulty.    From  the  condition 

we  obtain,  first,  if  x — 0,    x  ^  0, 

then,  ifi;=0    and    #=0,     Ac^O, 

which  is  the  only  limitation  on  the  value  of  Sx.  With  this  condition, 
we  deduce  from  (6)  that  either 

««0,  £«0,  and  £>-; 
m 

or  ai=— . 

m 
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That  is,  if  x«0,  x  has  the  greater  of  the  values  —  and  0 ;  otherwise, 

m 

In  cases  of  this  kind  also,  in  which  the  function  which  cannot 
exceed  a  certain  value  involves  the  velocities  (with  or  without  the 
coordinates),  one  may  easily  convince  himself  that  formula  (6)  is 
always  valid,  and  always  sufficient  to  determine  the  accelerations 
with  the  aid  of  the  conditions  afforded  by  the  constraints  of  the 
system. 

But  instead  of  examining  such  cases  in  detail,  we  shall  proceed  to 
consider  the  subject  from  a  more  general  point  of  view. 

Comparison  of  the  New  Formula  with  the  Statical  Principle  of 
Virtual  Velocities. — Case  of  Discontiguous  Changes  of  Velocity. 

Formula  (1)  has  so  far  served  as  a  point  of  departure.  The  general 
validity  of  this,  the  received  form  of  the  indeterminate  equation  of 
motion,  being  assumed,  it  has  been  shown  that  formula  (6)  will  be 
valid  and  sufficient,  even  in  cases  in  which  both  (1)  and  (7)  fail.  We 
now  proceed  to  show  that  the  statical  principle  of  virtual  velocities, 
when  its  real  signification  is  carefully  considered,  leads  directly  to 
formula  (6),  or  to  an  analogous  formula  for  the  determination  of  the 
discontinuous  changes  of  velocity,  when  such  occur.  This  will  be  (he 
case  even  if  we  start  with  the  usual  analytical  expression  of  the 
principle  2(Z&+F^y+Z&)^0,  (8) 

to  which,  at  first  sight,  formula  (6)  appears  less  closely  related  than 
(7).  For  the  variations  of  the  coordinates  in  this  formula  must  be 
regarded  as  relating  to  differences  between  the  configuration  which 
the  system  has  at  a  certain  time,  and  which  it  will  continue  to  have 
in  case  of  equilibrium,  and  some  other  configuration  which  the  system 
might  be  supposed  to  have  at  some  subsequent  time.  These  temporal 
relations  are  not  indicated  explicitly  in  the  notation,  and  should  not 
be,  since  the  statical  problem  does  not  involve  the  time  in  any 
quantitative  manner.  But  in  a  dynamical  problem,  in  which  we  take 
account  of  the  time,  it  is  hardly  natural  to  use  Sx,  $y,  Sz  in  the  same 
sense.  In  any  problem  in  which  a?,  y,  z  are  regarded  as  functions 
of  the  time,  Sx>  Syt  Sz  are  naturally  understood  to  relate  to  differences 
between  the  configuration  which  the  system  has  at  a  certain  time, 
and  some  other  configuration  which  it  might  (conceivably)  have  had 
at  that  time  instead  o/that  which  it  actually  had. 

Now  when  we  suppose  a  point  to  have  a  certain  position,  specified 
by  x,  yt  z,  at  a  certain  time,  its  position  at  that  times  is  no  longer  a 
subject  of  hypothesis  or  of  question.    It  is  its  future  positions  which 
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form  the  subject  of  inquiry.  Its  position  in  the  immediate  future 
is  naturally  specified  by 

*+a><ft+$#(ft2+etc.,  y+yeft+i^+etc.,  «+*<&+ jpdP+efe, 
and  we  may  regard  the  variations  of  these  expressions  as  corre- 
sponding to  the  Sx,  8y,  Sz  of  the  statical  problem.  It  is  evidently 
sufficient  to  take  account  of  the  first  term  of  these  expressions  of 
which  the  variation  is  not  zero.  Now,  xt  y,  z,  as  has  already  been 
said,  are  to  be  regarded  as  constant.  With  respect  to  the  terms 
containing  x,  y,  £,  two  cases  are  to  be  distinguished,  according  as 
there  is,  or  is  not,  a  finite  change  of  velocity  at  the  instant  considered. 

Let  us  first  consider  the  most  important  case,  in  which  there  is  no 
discontinuous  change  of  velocity.  In  this  case,  x,  y,  &  are  not  to  be 
regarded  as  variable  (by  S),  and  the  variations  of  the  above  ex- 
pressions are  represented  by 

$<&(#,    ±Sydt\    \8idt\ 

which  are,  therefore,  to  be  substituted  for  Sx,  Sy,  Sz  in  the  general 
formula  of  equilibrium  (8)  to  adapt  it  to  the  conditions  of  a  dynamical 
problem.  By  this  substitution  (in  which  the  common  factor  \dt% 
may  of  course  be  omitted),  and  the  addition  of  the  terms  expressing 
the  reaction  against  acceleration,  we  obtain  formula  (6). 

But  if  the  circumstances  are  such  that  there  is  (or  may  be)  a 
discontinuity  in  the  values  of  x,  y,  z  at  the  instant  considered,  it  is 
necessary  to  distinguish  the  values  of  these  expressions  before  and 
.after  the  abrupt  change.  For  this  purpose,  we  may  apply  x,  y,  z 
to  the  original  values,  and  denote  the  changed  values  by  <r+ Aflc, 
y+Ay,  z+Az.  The  value  of  a?  at  a  time  very  shortly  subsequent 
to  the  instant  considered,  will  be  expressed  by  a:+(»-f  A«)ctt-f  etc,  in 
which  we  may  regard  Ax  as  subject  to  the  variation  denoted  by  & 
The  variation  of  the  expression  is  therefore  SAxdt.  Instead  of  —  mx, 
which  expresses  the  reaction  against  acceleration,  we  need  in  the 
present  case  —  mAx  to  express  the  reaction  against  the  abrupt  change 
of  velocity.  A  reaction  against  such  a  change  of  velocity  is,  of 
course,  to  be  regarded  as  infinite  in  intensity  in  comparison  with 
reactions  due  to  acceleration,  and  ordinary  forces  (such  as  cause 
acceleration)  may  be  neglected  in  comparison.  If,  however,  we 
conceive  of  the  system  as  acted  on  by  impulsive  forces  (i.e.,  such 
as  have  no  finite  duration,  but  are  capable  of  producing  finite  changes 
of  velocity,  and  are  measured  numerically  by  the  discontinuities  of 
velocity  which  they  produce  in  the  unit  of  mass),  these  forces  should 
be  combined  with  the  reactions  due  to  the  discontinuities  of  velocity 
in  the  general  formula  which  determines  these  discontinuities.  If 
the  impulsive  forces  are  specified  by  X,  Y,  Z,  the  formula  will  be 

[(X-mAs)  8Ax+(Y-mAy)  SAy +(Z-mA£)  SAz)]  ^  0.       (9) 
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The  mte  wil  lemark  the  strict  analogy  between  Ok  formula 

sad  (dX  which  would  perhaps  be  more  dearly  exhibited  if  we  should 

dm  d*  di 
write  -jt,  *,  tt  for  *,  i/,  £  in  that  formula. 

But  these  formula  may  be  established  in  a  mueh  more  direct 
manner..  For  the  formula  (8),  although  for  many  purposes  the  most 
convenient  expression  of  the  principle  of  virtual  velocities,  is  by  no 
means  the  most  convenient  for  our  present  purpose.  As  the  usual 
name  of  the  principle  implies,  it  holds  true  of  velocities  as  well 
as  of  displacements,  and  is  perhaps  more  simple  and  more  evident 
when  thus  applied.* 

If  .we  wish  to  apply  the  principle,  thus  understood,  to  a  moving 
system  so  as  to  determine  whether  certain  changes  of  velocity 
speciled  by  Aft,  Ay,  A*  are  those  which  the  system  will  really  receive 
at  a  given  instant,  the  velocities  to  be  multiplied  into  the  forces  and 
reactions  in  the  most  simple  application  of  the  principle  are  manifestly 
such  as  may  be  imagined  to  be  compounded  with  the  assumed 
velocities,  and  are  therefore  properly  specified  by  JAft,  JAy,  JA£. 
The  formula  (9)  may  therefore  be  regarded  as  the  most  direct  appli- 
cation of  the  principle  of  virtual  velocities  to  discontinuous  changes 
of  velocity  in  a  moving  system. 

In  the  eass  of  a  system  in  which  there  are  no  discontinuous  changes 
of  velocity,  but  which  is  subject  to  forces  tending  to  produce  accelera- 
tions, when  we  wish  to  determine  whether  certain  accelerations, 
specified  by  &,  #,  I,  are  such  as  the  system  will  really  receive,  it  is 
evidently  necessary  to  consider  whether  any  possible  variation  of 
these  accelerations  is  favored  more  than  it  is  opposed  by  the  forces 


•Iven  in  8tatios,  the  principle  of  virtual  vafoefties,  as  distinguished  from  that  of 
virtual  oftpfoctmetg*,  has  a  certain  advantage  in  respect  of  its  evidence.  The  demon- 
stration of  the  principle  in  the  first  seotion  of  the  M4osmiqu€  Analytique,  if  velocities 
had  been  considered  instead  of  displacement*,  would  not  have  been  exposed  to  en 
objection,  which  has  been  expressed  by  M.  Bertrand  in  the  following  words :  "  On  a 
object*,  aveo  raison,  a  oette  assertion  de  Lagrange  rexample  d'un  point  pesant  en 
eouilibre  au  sommet  le  plus  cTev*  d'une  oonrbe;  il  est  Evident  qu'un  deplaoement 
inimiment  petit  le  ferait  desoondre,  et,  pourtant,  ee  deplaoement  ne  se  produit  pas." 
(MdeatUqu*  Atudytiqet,  troiseme  edition,  tome  1,  page  23,  note  de  M.  Bertrand.)  The 
value  of  s  (the  height  of  the  point  above  a  horizontal  plane)  oan  osrtsinly  be  diminished 
by  a  displacement  of  the  point,  but  the  value  of  i  is  not  effected  by  any  velocity  given 
to  the  point. 

The  real  diftoulty  in  the  consideration  of  displacements  is  that  they  are  only  possible 
at  a  time  subsequent  to  that  in  which  the  system  has  the  configuration  to  which  the 
question  of  equilibrium  relates.  We  may  make  the  interval  of  time  infinitely  short, 
but  it  will  always  be  difficult,  in  the  establishing  of  fundamental  principles,  to  treat  a 
conception  of  this  kind  (relating  to  what*  is  possible  after  an  infinitesimal  interval  of 
time)  with  the  same  rigor  as  the  idea  of  velocities  or  accelerations,  which,  in  the  oases 
to  which  (9)  and  (S)  respectively  relate,  we  may  regard  as  oommunioated  immediately  to 
the  system. 
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and  reactions  of  the  system.  The  formula  (6)  expresses  a  criterion 
of  this  kind  in  the  most  simple  and  direct  manner.  If  we  regard 
a  force  as  a  tendency  to  increase  a  quantity  expressed  by  xy  the 
product  of  the  force  by  Sx  is  the  natural  measure  of  the  extent  to 
which  this  tendency  is  satisfied  by  an  arbitrary  variation  of  the 
accelerations.  The  principle  expressed  by  the  formula  may  not  be 
very  accurately  designated  by  the  words  virtual  velocities,  but  it 
certainly  does  not  differ  from  the  principle  of  virtual  velocities  (in 
the  stricter  sense  of  the  term),  more  than  this  differs  from  that  of 
virtual  displacements, — a  difference  so  slight  that  the  distinction  of 
the  names  is  rarely  insisted  upon,  and  that  it  is  often  very  difficult 
to  tell  which  form  of  the  principle  is  especially  intended,  even  when 
the  principle  is  enunciated  or  discussed  somewhat  at  length. 

But,  although  the  formulas  (6)  and  (9)  differ  so  little  from  the 
ordinary  formulas,  they  not  only  have  a  marked  advantage  in  respect 
of  precision  and  accuracy,  but  also  may  be  more  satisfactory  to  the 
mind,  in  that  the  changes  considered  (to  which  S  relates),  are  not  so 
violently  opposed  to  all  the  possibilities  of  the  case  as  are  those  which 
are  represented  by  the  variations  of  the  coordinates.*  Moreover,  as 
we  shall  see,  they  naturally  lead  to  various  important  laws  of  motion. 

Transformation  of  the  New  Formula. 

Let  us  now  consider  some  of  the  transformations  of  which  our 
general  formula  (6)  is  capable.    If  we  separate  the  terms  containing 

*  It  may  have  seemed  to  some  readers  of  the  MScanique  Analytique—*  work  of  whioh 
the  unity  of  method  is  one  of  the  most  striking  characteristics,  and  that  to  whioh  its 
universally  recognized  artistio  merit  is  in  great  measure  due— that  the  treatment  of 
dynamical  problems  in  that  work  is  not  entirely  analogous  to  the  treatment  of  statioal 
problems.  The  statioal  question,  whether  a  system  will  remain  in  equilibrium  in  a 
given  configuration,  is  determined  by  Lagrange  by  considering  all  possible  motions  of 
the  system  and  inquiring  whether  there  is  any  reason  why  the  system  should  take  any 
one  of  them.  A  similar  method  in  dynamics  would  be  based  upon  a  comparison  of  a 
proposed  motion  with  all  other  motions  of  whioh  the  system  is  capable  without  violating 
its  kinematioal  conditions.  Instead  of  this,  Lagrange  virtually  reduces  the  dynamical 
problem  to  a  statical  one,  and  considers,  not  the  possible  variations  of  the  proposed 
motion,  but  the  motions  whioh  would  be  possible  if  the  system  were  at  rest.  This 
reduction  of  a  given  problem  to  a  simpler  one,  whioh  has  already  been  solved,  is  a 
method  which  has  its  advantages,  but  it  is  not  the  characteristic  method  of  the 
M4caniqut  Analytiqxu.  That  whioh  most  distinguishes  the  plan  of  this  treatise  from 
the  usual  type  is  the  direot  application  of  the  general  principle  to  each  particular  case. 

The  point  is  perhaps  of  small  moment,  and  may  be  differently  regarded  by  others, 
but  it  is  mentioned  here  because  it  was  a  feeling  of  this  kind  (whether  justified  or  not) 
and  the  desire  to  express  the  formula  of  motion  by  means  of  a  maximum  or  minimum 
condition,  in  whioh  the  conditions  under  whioh  the  maximum  or  minimum  subsists 
should  be  suoh  as  the  problem  naturally  affords  (Oauss's  principle  of  /east  constraint 
being  at  the  time  unknown  to  the  present  writer,  and  the  conditions  under  whioh  the 
minimum  subsists  in  the  principle  of  least  action  being  suoh  that  that  is  hardly  satis- 
factory as  a  fundamental  principle),  whioh  led  to  the  formulas  proposed  in  this  paper. 
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the  masses  of  the  particles  from  those  which  contain  the  forces,  we 
k*™        2(X*g+  rty+£^-2[}mJ(#+ jf^*)]^,  (10) 

or,  if  we  write  u  for  the  acceleration  of  a  particle, 

2(X&e+  F<ty+Z<&)-<&2($mtt*)^0.  (11) 

If,  instead  of  terms  of  the  form  XSx,  or  in  addition  to  such  terms, 
equation  (1)  had  contained  terms  of  the  form  P8pf  in  which  p  denotes 
any  quantity  determined  by  the  configuration  of  the  system,  it  is 
evident  that  these  would  give  terms  of  the  form  P8p  in  (6),  (10)  and 
(11).  For  the  considerations  which  justified  the  substitution  of  8$, 
Sij,  82  for  Sxy  8y,  8z  in  the  usual  formula  were  in  no  respect  dependent 
upon  the  fact  that  x,  y,  z  denote  rectangular  coordinates,  but  would 
apply  equally  to  any  other  quantities  which  are  determined  by  the 
configuration  of  the  system. 

Hence,  if  the  moments  of  all  the  forces  of  the  system  are  represented 
by  the  sum  9(Pdp), 

the  general  formula  of  motion  may  be  written 

JKP<SpW2(*mttf)^0  (12) 

If  the  forces  admit  of  a  force-function  F,  we  have 

af-asumu^o, 

or  *[r-2(imtt*)]£0.  (13) 

But  if  the  forces  are  determined  in  any  way  whatever  by  the 
configuration  and  velocities  of  the  system,  with  or  without  the  time, 
X,  F,  Z  and  P  will  be  unaffected  by  the  variation  denoted  by  8,  and 
we  may  write  the  formula  of  motion  in  the  form 

^(Zs+F^+^-lmtt1)^,  (14) 

or  8[»(Pp)-2(frw>*)]£0.  (15) 

If  the  forces  are  determined  by  the  configuration  alone,  or  the 
configuration  and  the  time,  &T=0,  <SF=0,  8Z=*0t  SP=0,  and  the 
general  formula  may  be  written 

8[%2(Xx+Yy+Zz)-2(fau*)j^0,  (16) 

or  *[J*(i*)-S(Jmii«)]ga  (17) 

The  quantity  affected  by  8  in  any  one  of  the  last  five  formulae 
has  not  only  a  maximum  value,  but  absolutely  the  greatest  value 
consistent  with  the  constraints  of  the  system  This  may  be  shown 
in  reference  to  (15)  by  giving  to  p,  x,  i),  zt  contained  explicitly 
or  implicitly  in  the  expression  affected  by  8,  any  possible  finite 
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increments  p't  x\  f.  &,  and  subtracting  the  original  value  of  the 
expression  from  the  value  thus  modified.     Now, 

-»(Pp)+2[im(&+y*+z')] 

But  since  p,  &,  y'f  %  are  proportional  to  and  of  the  same  sign  with 
possible  values  of  $p,  S&}  Sy,  8z,  we  have,  by  the  general  formula  of 
motion,  *(Pp')-2[m(&o'+ifil'+M)]^0. 

The  second  member  of  the  preceding  equation  is  therefore  negative. 
The  first  member  is  therefore  negative,  which  proves  the  proposition 
with  respect  to  (15).  The  demonstration  is  precisely  the  same  with 
respect  to  (13)  and  (14),  which  may  be  regarded  as  particular 
cases  of  (15). 

To  show  the  same  with  regard  to  (16)  and  (17),  we  have  only  to 
observe  that  the  quantities  affected  by  S  in  these  formulae  differ  from 
those  affected  by  the  same  symbol  in  (14)  and  (15)  only  by  the  terms 

^tsb+Yy+Zi)    and    jKPp), 

which  will  not  be  affected  by  any  change  in  the  accelerations  of 
the  system. 

When  the  forces  are  determined  by  the  configuration  (with  or 
without  the  time),  the  principle  may  be  enunciated  as  follows:  The 
accelerations  in  the  system  are  always  such  that  the  acceleration  of 
the  rate  of  work  done  by  the  forces  diminished  by  one-half  the  sum 
of  the  products  of  the  masses  of  the  particles  by  the  squares  of  their 
accelerations  has  the  greatest  possible  value. 

The  formula  (17),  although  in  appearance  less  simple  than  (15),  not 
only  is  more  easily  enunciated  in  words,  but  has  the  advantage  that 

the  quantity  -^$t{Pp)  is  entirely  determined  by  the  system  with  its 

forces  and  motions,  which  is  not  the  case  with  jf(Pp).  The  value  of 
the  latter  expression  depends  upon  the  manner  in  which  we  choose  to 
represent  the  forces.  For  example,  if  a  material  point  is  revolving  in 
a  circle  under  the  influence  of  a  central  force,  we  may  write  either 
Xx+  Yy+Zz  or  Rft  for  Pp,  R  and  r  denoting  respectively  the  force 
and  radius  vector.    Now  Xx+Yy+Zz  is  manifestly  unequal  to  Rf. 

But  Xx+Yy+Zz  is  equal  to  Rrt  and  i(Xx+Yy+Zz)  is  equal  to 

It  may  not  be  without  interest  to  see  what  shape  our  general 
formulae  will  take  in  one  of  the  most  important  cases  of  forces 
dependent  upon  the  velocities.  If  a  body  which  can  be  treated  as  a 
point  is  moving  in  a  medium  which  presents  a  resistance  expressed  by 
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any  function  of  the  velocity,  the  terms  due  to  that  resistance  in  the 
general  formula  of  motion  may  be  expressed  in  the  form 

where  v  denotes  the  velocity  and  <f>(v)  the  resistance.    But 

v      v      v     at 
The  terms  due  to  the  resistance  reduce,  therefore,  to 

S[4>(v)v], 

or,  S^f(v), 

where  /denotes  the  primitive  of  the  function  denoted  by  0. 

Discontinuous  Changes  of  Velocity. — Formula  (9),  which  relates  to 
discontinuous  changes  of  velocity,  is  capable  of  similar  transformations. 
If  we  set  t^=Aa;t+Ay2+Ai2, 

the  formula  reduces  to 

tf2(XA*+YAy+ZA£--imw*)S0,  (18) 

where  X,  Y,  Z  are  to  be  regarded  as  constant.  If  jl(Pdp)  represents 
the  sum  of  the  moments  of  the  impulsive  forces,  and  we  regard  P  as 
constant,  we  have 

<WPA£)-2(lmw*)]^0.  (19) 

The  expressions  affected  by  S  in  these  formulae  have  a  greater  value 
than  they  would  receive  from  any  other  changes  of  velocity  consistent 
with  the  constraints  of  the  system. 

Deduction  of  other  Properties  of  Motion. 
The  principles  which  have  been  established  furnish  a  convenient 
point  of  departure  for  the  demonstration  of  various  properties  of 
motion  relating  to  maxima  and  minima.  We  may  obtain  several 
such  properties  by  considering  how  the  accelerations  of  a  system,  at  a 
given  instant,  will  be  modified  by  changes  of  the  forces  or  of  the 
constraints  to  which  the  system  is  subject.  Let  us  suppose  that  the 
forces  X,  F,  Z  of  a  system  receive  the  increments  X\  Y\  Zt,  in  con- 
sequence of  which,  and  of  certain  additional  constraints,  which  do  not 
produce  any  discontinuity  in  the  velocities,  the  components  of  accelera- 
tion x,  if,  z  receive  the  increments  x't  $',  2.    The  expression 

2[(Z+Z0(*+«/)+(F+F)(j/+y,)+(^+^,)(«+^ 

-4m{(*+dOH(jf+^+<*+*y}]  (20) 
will  be  the  greatest  possible  for  any  values  of  x,  y\  %  consistent  with 
the  constraints.    But  this  expression  may  be  divided  into  three  parts, 

2[(X+X^x+(Y+Y^gHZ+^-l™(&+y*+*)l       (21) 

l[Xx,+  Yy'+Zz'-m(xx'+yy'+zz')l  (22) 

and  2[X'x'+  Ttf+Z'z'-  im(£*+jp+**)l  (23) 
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The  first  part  is  evidently  constant  with  reference  to  variations  of 
&'>  V'>  3V*nd  may,  therefore,  be  neglected.  With  respect  to  the  second 
part  we  observe  that  by  the  general  formula  of  the  motion  we  have 

2,[X$x+Y8jj+ZS2-m(M£+ySij+28z)]=Q 
for  all  values  of  Sxt  8$,  Sis  which  are  possible  and  reversible  before  the 
addition  of  the  new  constraints.  But  values  proportional  to  x,  ij\  £', 
and  of  the  same  sign,  are  evidently  consistent  with  the  original  con- 
straints, and  when  the  components  of  acceleration  are  altered  to 
*+*',  V+y'>  %+£',  variations  of  these  quantities  proportional  to  and 
of  the  same  sign  as  —  x\  —  $',  —%  are  evidently  consistent  with  the 
original  constraints.  Now  if  these  latter  variations  were  not  possible 
before  the  accelerations  were  modified  by  the  addition  of  the  new  forces 
and  constraints,  it  must  be  that  some  constraint  was  then  operative 
which  afterwards  ceased  to  be  so.  The  expression  (22)  will,  therefore, 
be  equal  to  zero,  provided  only  that  all  the  constraints  which  were 
operative  before  the  addition  of  the  new  forces  and  constraints,  remain 
operative  afterwards.*  With  this  limitation,  therefore,  the  expression 
(23)  must  have  the  greatest  value  consistent  with  the  constraints. 
This  principle  may  be  expressed  without  reference  to  rectangular 
coordinates.  If  we  write  v!  for  the  relative  acceleration  due  to  the 
additional  forces  and  constraints,  we  have 

and  expression  (23)  reduces  to 

2(X'#+  Y'y'+Z'Z'-imu'*}.  (24) 

If  the  sum  of  the  moments  of  the  additional  forces  which  are 
considered  is  represented  by  &(Qdq)  (the  q  representing  quantities 
determined  by  the  configuration  of  the  system),  we  have 

l(X'x+ry+Zfi)=»(Qq). 

We  may  distinguish  the  values  of  -ri  immediately  before  and  imme- 
diately after  the  application  of  the  additional  forces  and  constraints 
by  the  expressions  q  and  q+q'.  With  this  understanding,  we  have, 
by  differentiation  of  the  preceding  equation, 

2[^/x+Fy+^+Z'(«+^)+F(^+f)+^(«+^/)] 

*  As  an  illustration  of  the  significance  of  this  limitation,  we  may  consider  the  con- 
dition afforded  by  the  impenetrability  of  two  bodies  in  oontaot.  Let  us  suppose  that  if 
subject  only  to  the  original  forces  and  constraints  they  would  continue  in  contact,  but 
that,  under  the  influenoe  of  the  additional*  forces  and  constraints,  the  oontaot  will  cease. 
The  impenetrability  of  the  bodies  then  oeases  to  be  operative  as  a  constraint.  Such 
oases  form  an  exoeption  to  the  prinoiple  whioh  is  to  be  established.  But  there  are  no 
exceptions  when  all  the  original  constraints  are  expressed  by  equations. 
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whence  it  appears  that  2,(X'tf+  Ytf+Z'*)  differs  from  JKG^O  only 
by  quantities  which  are  independent  of  the  relative  acceleration  due 
to  the  additional  forces  and  constraints.  It  follows  that  these  relative 
accelerations  are  such  as  to  make 

*(«r)-2(imu-«)  (25) 

a  maximum. 

It  will  be  observed  that  the  condition  which  determines  these 
relative  accelerations  is  of  precisely  the  same  form  as  that  which 
determines  absolute  accelerations. 

An  important  case  is  that  in  which  new  constraints  are  added  but 
no  new  forces.  The  relative  accelerations  are  determined  in  this  case 
by  the  condition  that  2  (Jmu'2)  is  a  minimum.  In  any  case  of  motion, 
in  which  finite  forces  do  not  act  at  points,  lines  or  surfaces,  we  may 
first  calculate  the  accelerations  which  would  be  produced  if  there  were 
no  constraints,  and  then  determine  the  relative  accelerations  due  to 
the  constraints  by  the  condition  that  2  (Jmu1)  is  a  minimum.  This 
is  Gauss's  principle  of  least  constraint* 

Again,  in  any  case  of  motion,  we  may  suppose  u  to  denote  the 
acceleration  which  would  be  produced  by  the  constraints  alone,  and  v! 
the  relative  acceleration  produced  by  the  forces ;  we  then  have 

whence,  if  we  write  u"  for  the  resultant  or  actual  acceleration, 

2(imu*)+2(lmu't)=2(imu//4). 
Moreover,  differentiating  (25),  we  obtain 

*(W)-"2[m(x'^/+y/^'+^r)]=0, 
whence,  since  8q't  $&,  6jj\  8&  may  have  values  proportional  to  q\  #, 

^  JK<#>22(imu'*). 

These  relations  are  similar  to  those  which  exist  with  respect  to 

vis  viva  and  impulsive  forces. 

Particular  Equations  of  Motion. 

From  the  general  formula  (12),  we  may  easily  obtain  particular 
equations  which  will  express  the  laws  of  motion  in  a  very  general 
form. 

Let  dc^,  dwv  etc  be  infinitesimals  (not  necessarily  complete 
differentials)  the  values  of  which  are  independent,  and  by  means 


*  This  principle  may  be  derived  very  directly  from  the  general  formal*  (6),  or  vice 
verm,  for  2  ( Jmu'*)  may  be  put  in  the  form 

>\.»{(-2f<>-zf*(-sm 

the  variation  of  whioh,  with  the  sign  ohanged,  is  identical  with  the  first  member  of  (6). 
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of  which  we  can  perfectly  define  any  infinitesimal  change  in  the 
configuration  of  the  system ;  and  let 


where  da^,  da^  are  to  be  determined  by  the  change  in  the' con- 
figuration in  the  interval  of  time  dt ;  and  let 


Sh  *■-&• 


etc 


Also  let  J7=2(Jmw2). 

It  is  evident  that  U  can  be  expressed  in  terms  of  c^,  6%,  etc, 
#p  tog,  etc,  and  the  quantities  which  express  the  configuration  of  the 
system,  and  that  (since  S  is  used  to  denote  a  variation  which  does 
not  affect  the  configuration  or  the  velocities), 

Moreover,  since  the  quantities  p  in  the  general  formula  are  entirely 
determined  by  the  configuration  of  the  system 


where  5*-  denotes  the  ratio  of  simultaneous  values  of  dp  and  da^, 
when  dw%,  etc  are  equal  to  zero,  and  -**-,  etc  are  to  be  interpreted 

on  the  same  principle.     Multiplying  by  P,  and  taking  the  sum  with 
respect  to  the  several  forces,  we  have 

JKPp^Q^+Q^+etc, 

If  we  differentiate  with  respect  to  ty  and  take  the  variation  denoted 
by  8,  we  obtain         ,(p^)=Qi&Ji+Q^+etc. 

The  general  formula  (12)  is  thus  reduced  to  the  form 

(^-3f)^+(°*-3|)^+etc-=0-  (26) 

If  the  forces  have  a  potential  V,  we  may  write 

dV 
where  -r—  denotes  the  ratio  of  dV  and  do^  when  da>t,  etc  have  the 

value  zero,  and  the  analogous  expressions  are  to  be  interpreted  on 
the  same  principle 
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If  the  variations  A^,  &*,,  etc.  are  capable  both  of  positive  and  of 
negative  values,  we  must  have 

g-Q.     g-lH*.  (28) 

dU    dV   dU   dV     .  /om 

arc?  eras? ete-  (29) 

To  illustrate  the  use  of  these  equations  in  a  case  in  which  cfoj,  cfa,, 
etc  are  not  exact  differentials,  we  may  apply  them  to  the  problem  of 
the  rotation  of  a  rigid  body  of  which  one  point  is  fixed.  If  do^,  cfo,, 
d&t  denote  infinitesimal  rotations  about  the  principal  axes  which  pass 
through  the  fixed  point,  I^,  Qj,  Qa  will  denote  the  moments  of  the 
impressed  forces  about  these  axes,  and  the  value  of  U  will  be  given 
by  the  formula 

2tf=(a+6+c)(^+(tf+#)l-(»f+ 

+(6+c)ffl?+(c+a)0j+(a+6)tfJ, 

where  a,  fr,  and  c  are  constants,  a+6,  6+c,  c-f  a  being  the  moments 
of  inertia  about  the  three  axes.    Hence, 

g_=s(6-c)aj«8+(6+c)dJI,    ^-=(c-a)«v»1-Kc4-a)^, 
3-=(a-6)a>l«l-f(a+6)^; 

and  the  equations  of  motion  are 

„  _(c-&)A**»+Qi 
*"' ^+5 ' 

(q-c)c&8»1+ni 
^  a+c 

<*- JTm ' 
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ON  THE  FUNDAMENTAL  FORMULA  OF  STATISTICAL 
MECHANICS,  WITH  APPLICATIONS  TO  ASTRONOMY 
AND  THERMODYNAMICS. 

[Proceedings  of  the  American  Association  for  the  Advancement  of  Science, 
vol.  xxxin.  pp.  57,  58,  1884.J 

(ABSTRACT.) 

Suppose  that  we  have  a  great  number  of  systems  which  consist 
of  material  points  and  are  identical. in  character,  but  different  in 
configuration  and  velocities,  and  in  which  the  forces  are  determined 
by  the  configuration  alone.  Let  the  number  of  systems  in  which  the 
coordinates  and  velocities  lie  severally  between  the  following  limits, 
viz.,  between 

<Bj  and  xx-\-dxXt 

yx  and  yx+dyx, 
zx  and   zx+dzxt 
$2  and  Xt+dxif 
etc, 

Xj  and  Xj-r-d^y 
yx  and  yx+dyv 
ix  and  ii+dil> 
x%  and  &,+<&£,, 
etc., 
be  denoted  by 

L  do&i  dyx  dzxdx%etc  dd^  dyx  dzx  dxt  etc. 

The  manner  in  which  the  quantity  L  varies  with  the  time  is  given 
by  the  equation 

dL        ^[dL.dLSi 

ar--sLa5*+ai*J' 

where  t,  a^,  ylh  zltxt,  etc,  afcj,  y,,  zx%  xt,  etc,  are  the  independent 
variables,  and  the  summation  relates  to  all  the  coordinates. 

The  object  of  the  paper  is  to  establish  this  proposition  (which  is  not 
claimed  as  new,  but  which  has  hardly  received  the  recognition  which 
it  deserves)  and  to  show  its  applications  to  astronomy  and  thermo- 
dynamics. 
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III. 

ELEMENTS  OF  VECTOR  ANALYSIS. 

[Privately  printed,  New  Haven,  pp.  17-50,  1881 ;  pp.  50-90,  1884.] 

(The  fundamental  principles  of  the  following  analysis  are  saoh  as  are  familiar  under 
a  slightly  different  form  to  students  of  quaternions.  The  manner  in  whioh  the  subjeot 
is  developed  is  somewhat  different  from  that  followed  in  treatises  on  quaternions,  since 
the  objeot  of  the  writer  does  not  require  any  use  of  the  oonoeption  of  the  quaternion, 
being  simply  to  give  a  suitable  notation  for  those  relations  between  vectors,  or  between 
vectors  and  scalars,  whioh  seem  most  important,  and  which  lend  themselves  most  readily 
to  analytical  transformations,  and  to  explain  some  of  these  transformations.  Aa-a 
precedent  for  such  a  departure  from  quaternionio  usage,  Clifford's  Kinematic  may  be 
cited  In  this  connection,  the  name  of  Grassmann  may  also  be  mentioned,  to  whose 
system  the  following  method  attaches  itself  in  some  respects  more  olosely  than  to  that 
of  Hamilton.) 

CHAPTBB  L 
CONCERNING  THB  ALGEBRA  OF  VECTORS. 

Fundamental  Notions. 

1.  Definition. — If  anything  has  magnitude  and  direction,  its  mag- 
nitude and  direction  taken  together  constitute  what  is  called  a  vector* 

The  numerical  description  of  a  vector  requires  three  numbers,  but 
nothing  prevents  us  from  using  a  single  letter  for  its  symbolical 
designation.  An  algebra  or  analytical  method  in  which  a  single  letter 
or  other  expression  is*  used  to  specify  a  vector  may  be  called  a  vector 
algebra  or  vector  amal/ym*. 

Def. — As  distinguished  from  vectors  the  real  (positive  or  negative) 
quantities  of  ordinary  algebra  are  called  ecalcure* 

As  it  is  convenient  that  the  form  of  the  letter  should  indicate 
whether  a  vector  or  a  scalar  is  denoted,  we  shall  use  the  small  Greek 
letters  to  denote  vectors,  and  the  small  English  letters  to  denote 
scalar*.  (The  three  letters,  t,  j,  k,  will  make  an  exception,  to  be 
mentioned  more  particularly  hereafter.  Moreover,  -w  will  be  used  in 
its  usual  scalar  sense,  to  denote  the  ratio  of  the  circumference  of 
a  circle  to  its  diameter.) 


*  The  imaginaries  of  ordinary  algebra  may  be  called  biecalar*,  and  that  whioh  cor- 
responds to  them  in  the  theory  of  vectors,  bivector*.  But  we  shall  have  no  occasion  to 
consider  either  of  these.    [See,  however,  footnote  on  p.  84.] 
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2.  Def. — Vectors  are  said  to  be  equal  when  they  are  the  same  both 
in  direction  and  in  magnitude.  This  equality  is  denoted  by  the 
ordinary  sign,  as  a=/3.  The  reader  will  observe  that  this  vector 
equation  is  the  equivalent  of  three  scalar  equations. 

A  vector  is  said  to  be  equal  to  aero,  when  its  magnitude  is  zero. 
Such  vectors  may  be  set  equal  to  one  another,  irrespectively  of  any 
considerations  relating  to  direction. 

3.  Perhaps  the  most  simple  example  of  a  vector  is  afforded  by  a 
directed  straight  line,  as  the  line  drawn  from  A  to  R  We  may  use 
the  notation  AB  to  denote  this  line  as  a  vector,  Le.,  to  denote  its 
length  and  direction  without  regard  to  its  position  in  other  respects. 
The  points  A  and  B  may  be  distinguished  as  the  origin  and  the 
terminus  of  the  vector.  Since  any  magnitude  may  be  represented  by 
a  length,  any  vector  may  be  represented  by  a  directed  line;  and  it 
will  often  be  convenient  to  use  language  relating  to  vectors,  which 
refers  to  them  as  thus  represented. 

Reversal  of  Direction,  Scalar  Multiplication  and  Division. 

4.  The  negative  sign  ( —  )  reverses  the  direction  of  a  vector.  (Some- 
times the  sign  +  may  be  used  to  call  attention  to  the  fact  that  the 
vector  has  not  the  negative  sign.) 

Def. — A  vector  is  said  to  be  multiplied  or  divided  by  a  scalar 
when  its  magnitude  is  multiplied  or  divided  by  the  numerical  value 
of  the  scalar  and  its  direction  is  either  unchanged  or  reversed 
according  as  the  scalar  is  positive  or  negative.  These  operations 
are  represented  by  the  same  methods  as  multiplication  and  division 
in  algebra,  and  are  to  be  regarded  as  substantially  identical  with 
them.  The  terms  scalar  multiplication  and  scalar  division  are  used 
to  denote  multiplication  and  division  by  scalars,  whether  the  quantify 
multiplied  or  divided  is  a  scalar  or  a  vector. 

5.  Def — A  unit  vector  is  a  vector  of  which  the  magnitude  is  unify. 
Any  vector  may  be  regarded  as  the  product  of  a  positive  scalar 

(the  magnitude  of  the  vector)  and  a  unit  vector. 

The  notation  a©  may  be  used  to  denote  the  magnitude  of  the 
vector  a. 

Addition  and  Subtraction  of  Vectors. 

6.  Def — The  sum  of  the  vectors  a, /8,  etc  (written  a+/8+etc)  is 
the  vector  found  by  the  following  process.  Assuming  any  point  A, 
we  determine  successively  the  points  B,  C,  etc,  so  that  AB=a, 
BC*/8,  etc  The  vector  drawn  from  A  to  the  last  point  thus  deter- 
mined is  the  sum  required.  This  is  sometimes  called  the  geometrical 
sum,  to  distinguish  it  from  an  algebraic  sum  or  an  arithmetical  sum. 
It  is  also  called  the  resultant,  and  a,  /8,  etc  are  called  the  components. 
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When  the  vectors  to  be  added  are  all  parallel  to  the  same  straight  line, 
geometrical  addition  reduces  to  algebraic ;  when  they  have  all  the  same 
direction,  geometrical  addition  like  algebraic  reduces  to  arithmetical. 

It  may  easily  be  shown  that  the  value  of  a  sum  is  not  affected 
by  changing  the  order  of  two  consecutive  terms,  and  therefore  that 
it  is  not  affected  by  any  change  in  the  order  of  the  terms.  Again,  it 
is  evident  from  the  definition  that  the  value  of  a  sum  is  not  altered 
by  uniting  any  of  its  terms  in  brackets,  as  a+[)8+y]+ete.,  which  is 
in  effect  to  substitute  the  sum  of  the  terms  enclosed  for  the  terms 
themselves  among  the  vectors  to  be  added.  In  other  words,  the 
commutative  and  associative  principles  of  arithmetical  and  algebraic 
addition  hold  true  of  geometrical  addition. 

7.  Def. — A  vector  is  said  to  be  subtracted  when  it  is  added  after 
reversal  of  direction.  This  is  indicated  by  the  use  of  the  sign  — 
instead  of  +. 

8.  It  is  easily  shown  that  the  distributive  principle  of  arithmetical 
and  algebraic  multiplication  applies  to  the  multiplication  of  sums  of 
vectors  by  scalars  or  sums  of  scalars,  Le., 

(m+n+etc)  [a + /8 4-  etc.] = ma +ita+ etc 

+m/8+*i/8+ete. 

-fete 

9.  Vector  Equations. — If  we  have  equations  between  sums  and 
differences  of  vectors,  we  may  transpose  terms  in  them,  multiply  or 
divide  by  any  scalar,  and  add  or  subtract  the  equations,  precisely  as 
in  the  case  of  the  equations  of  ordinary  algebra.  Hence,  if  we  have 
several  such  equations  containing  known  and  unknown  vectors,  the 
processes  of  elimination  and  reduction  by  which  the  unknown  vectors 
may  be  expressed  in  terms  of  the  known  are  precisely  the  same,  and 
subject  to  the  same  limitations,  as  if  the  letters  representing  vectors 
represented  scalars.  This  will  be  evident  if  we  consider  that  in  the 
multiplications  incident  to  elimination  in  the  supposed  scalar  equations 
the  multipliers  are  the  coefficients  of  the  unknown  quantities,  or 
functions  of  these  coefficients,  and  that  such  multiplications  may  be 
applied  to  the  vector  equations,  since  the  coefficients  are  scalars. 

10.  Linear  relation  of  fowr  vectors,  Coordinates. — If  a,  j8,  and  y 
are  any  given  vectors  not  parallel  to  the  same  plane,  any  other  vector 
p  may  be  expressed  in  the  form 

p*=aa+b/3+cy. 

If  a,  )8,  and  y  are  unit  vectors,  a,  6,  and  c  are  the  ordinary  scalar 
components  of  p  parallel  to  a,  j8,  and  y.  If  /o  — OP,  (a,  /3,  y  being 
unit  vectors),  a,  6,  and  c  are  the  cartesian  coordinates  of  the  point  P 
referred  to  axes  through  O  parallel  to  a,  ft,  and  y.  When  the  values 
of  these  scalars  are  given,  p  is  said  to  be  given  in  terms  of  a,  j8,  and  y. 
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It  is  generally  in  this  way  that  the  value  of  a  vector  is  specified,  vis., 
in  terms  of  three  known  vectors.  For  such  purposes  of  reference,  a 
system  of  three  mutually  perpendicular  vectors  has  certain  evident 
advantages. 

11.  Normal  systems  of  unit  vectors. — The  letters  itj9  k  are  appro- 
priated to  the  designation  of  a  normal  system  of  unit  vectors,  i.e., 
three  unit  vectors,  each  of  which  is  at  right  angles  to  the  other  two 
and  determined  in  direction  by  them  in  a  perfectly  definite  manner. 
We  shall  always  suppose  that  &  is  on  the  side  of  the  i-j  plane  on 
which  a  rotation  from  %  to  j  (through  one  right  angle)  appears 
counter-clockwise.  In  other  words,  the  directions  of  i9  j,  and  k  are 
to  be  so  determined  that  if  they  be  turned  (remaining  rigidly  con- 
nected with  each  other)  so  that.i  points  to  the  east,  andj  to  the  north, 
k  will  point  upward.  When  rectangular  axes  of  X,  T,  and  Z  are 
employed,  their  directions  will  be  conformed  to  a  similar  condition, 
and  i,j,  k  (when  the  contrary  is  not  stated)  will  be  supposed  parallel 
to  these  axes  respectively.  We  may  have  occasion  to  use  more  than 
one  such  system  of  unit  vectors,  just  as  we  may  use  more  than  one 
system  of  coordinate  axes.  In  such  cases,  the  different  systems  may 
be  distinguished  by  accents  or  otherwise. 

12.  Nvmerical  computation  of  a  geometrical  sum. — If 

p=aa+bft+cy, 
<r  =  a' a «+■ Vft «+■ c'y , 
etc, 
then 

p+r+efa=(a+a'+efa)a+(b+b'+efa)ft+(c+c'+ete.)yt 

Le.,  the  coefficients  by  which  a  geometrical  sum  is  expressed  in  terms 
of  three  vectors  are  the  sums  of  the  coefficients  by  which  the  separate 
terms  of  the  geometrical  sum  are  expressed  in  terms  of  the  same 
three  vectors. 

Direct  and  Skew  Products  of  Vectors. 

13.  Def — The  direct  product  of  a  and  ft  (written  a.  ft)  is  the  scalar 
quantity  obtained  by  multiplying  the  product  of  their  magnitudes 
by  the  cosine  of  the  angle  made  by  their  directions. 

14.  Def — The  skew  product  of  a  and  ft  (written  ax/3)  is  a  vector 
function  of  a  and  ft.  Its  magnitude  is  obtained  by  multiplying  the 
product  of  the  magnitudes  of  a  and  ft  by  the  sine  of  the  angle  made 
by  their  directions.  Its  direction  is  at  right  angles  to  a  and  ft,  and 
on  that  side  of  the  plane  containing  a  and  ft  (supposed  drawn  from 
a  common  origin)  on  which  a  rotation  from  a  to  ft  through  an  arc 
of  less  than  180°  appears  counter-clockwise. 

The  direction  of  ax  ft  may  also  be  defined  as  that  in  which  an 
ordinary  screw  advances  as  it  turns  so  as  to  carry  a  toward  ft. 
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Again,  if  a  be  directed  toward  the  east,  and  ft  lie  in  the  same 
horizontal  plane  and  on  the  north  side  of  a,  axft  will  be  directed 
upward. 

15.  It  is  evident  from  the  preceding  definitions  that 

a.ft*=ft.a,    and     aX/8=  —  ftxcu 

16.  Moreover,  [na].j8-a.[njS]«n[a.#], 
and  [na]x0=ax[n0]=w[ax#|. 

The  brackets  may  therefore  be  omitted  in  such  expressions. 

17.  From  the  definitions  of  No.  11  it  appears  that 

i.i=j.j=k.k=l, 
i.j=j.i=i.k=k.i=j.k=k.j=0, 
ixi=0,        jxj=0,       &x*=0, 
ixj=k,       jxk=*i,       kxi=j, 
jxi*=—k,    &xj=— i,    ix*=— j. 

18.  If  we  resolve  ft  into  two  components  ft!  and  ft?,  of  which  the 
first  is  parallel  and  the  second  perpendicular  to  a,  we  shall  have 

a.ft=a.f¥  and  aX0=aXj8*. 

19.  a.\fi+y]=a.ft  +  a.y  and  ax[j8+y]=aX0  +  aXy. 

To  prove  this,  let  <r=/8+y,  and  resolve  each  of  the  vectors  ft,  y,  <r 
into  two  components,  one  parallel  and  the  other  perpendicular  to  a. 
Let  these  be  ftt,  ft?,  y',  y",  </,  <r".  Then  the  equations  to  be  proved 
will  reduce  by  the  last  section  to 

a.<r'=a.f¥+a.y  and  aX<rr=aXj8'r+aXy'r. 
Now  since  <r*=ft+y  we  may  form  a  triangle  in  space,  the  sides  of 
which  shall  be  ft,  y,  and  <r.  Projecting  this  on  a  plane  perpendicular 
to  a,  we  obtain  a  triangle  having  the  sides  ft?,  y",  and  er",  which 
affords  the  relation  o/'=:ft?+y".  If  we  pass  planes  perpendicular 
to  a  through  the  vertices  of  the  first  triangle,  they  will  give  on  a 
line  parallel  to  a  segments  equal  to  f¥,  y,  </.  Thus  we  obtain  the 
relation  </=f¥+y'.-  Therefore  a.o-'=a.f¥+a.y,  since  all  the  cosines 
involved  in  these  products  are  equal  to  unity.  Moreover,  if  a  is  a 
unit  vector,  we  shall  evidently  have  ax</r=aXj8*+axy',  since  the 
effect  of  the  skew  multiplication  by  a  upon  vectors  in  a  plane 
perpendicular  to  a  is  simply  to  rotate  them  all  90°  in  that  plane. 
But  any  case  may  be  reduced  to  this  by  dividing  both  sides  of  the 
equation  to  be  proved  by  the  magnitude  of  a.  The  propositions 
are  therefore  proved. 

20.  Hence, 

[a+ft]-y=a.y+ft.y,        [«+j8]xy=aXy+j8xy, 
[a+ft].[y+8]=a.y+a.8+ft.y+ft.8, 
[a+ft]*[y+Sl  =  aXy+ax8+ftxy+ftx8; 
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and,  in  general,  direct  and  skew  products  of  sums  of.  vectors  may  be 
expanded  precisely  as  the  products  of  sums  in  algebra,  except  that 
in  skew  products  the  order  of  the  factors  must  not  be  changed 
without  compensation  in  the  sign  of  the  term.  If  any  of  the  terms 
in  the  factors  have  negative  signs,  the  signs  of  the  expanded  product 
(when  there  is  no  change  in  the  order  of  the  factors)  will  be  deter- 
mined by  the  same  rules  as  in  algebra.  It  is  on  account  of  this 
analogy  with  algebraic  products  that  these  functions  of  vectors  are 
called  products  and  that  other  terms  relating  to  multiplication  are 
applied  to  them. 

21.  Numerical  calculation  of  direct  and  skew  products. — The 
properties  demonstrated  in  the  last  two  paragraphs  (which  may  be 
briefly  expressed  by  saying  that  the  operations  of  direct  and  skew 
multiplication  are  distributive)  afford  the  rule  for  the  numerical 
calculation  of  a  direct  product,  or  of  the  components  of  a  skew 
product,  when  the  rectangular  components  of  the  factors  are  .given 
numerically*    In  fact  if 

a«aji+30+«t,    and    /8=aft+y'j+«'&; 
a.£=aw/+yy/+s*', 

and  ax/8=(y/-^i+(^/-a5lOi+(ajy/-yaJ')fc 

22.  Representation  of  the  area  of  a  parallelogram  by  a  skew 
product.->-It  will  be  easily  seen  that  ax£  represents  in  magnitude 
the  area  of  the  parallelogram  of  which  a  and  j8  (supposed  drawn 
from  a  common  origin)  are  the  sides,  and  that  it  represents  in  direc- 
tion the  normal  to  the  plane  of  the  parallelogram  on  the  side  on 
which  the  rotation  from  a  toward  /3  appears  counter-clockwise. 

23.  Representation  of  the  volume  of  a  parallelepiped  by  a  triple 
product — It  will  also  be  seen  that  ax£.y*  represents  in  numerical 
value  the  volume  of  the  parallelopiped  of  which  a,  /8,  and  y  (supposed 
drawn  from  a  common  origin)  are  the  edges,  and  that  the  value  of 
the  expression  is  positive  or  negative  according  as  y  lies  on  the  side 
of  the  plane  of  a  and  ft  on  which  the  rotation  from  a  to  /8  appears 
counter-clockwise,  or  on  the  opposite  side. 

24.  Hence, 

aXj8.y=£xy.a=yXa./8=y.aXj8=a./8xy 

=0.yxa=  -)8xa.y=  -yX0.a=  -aXy.0 
=  —  y.£xa=  —  a.yX)8=  —  jS.aXy. 

It  will  be  observed  that,  all  the  products  of  this  type,  which  can  be 
made  with  three  given  vectors,  are  the  same  in  numerical  value,  and 


*  Sinoe  the  sign  x  is  only  used  between  veotors,  the  skew  multiplication  in  expressions 
of  this  kind  is  evidently  to  be  performed  first.  In  other  words,  the  above  expression 
must  be  interpreted  as  [ax/9]  .7. 
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that  any  two  such  products  are  of  the*  same  or  opposite  character  in 
respect  to  sign,  according  as  the  cyclic  order  of  the  letters  is  the  same 
or  different  The  product  vanishes  when  two  of  the  vectors  are 
parallel  to  the  same  line,  or  when  the  three  are  parallel  to  the  same 
plane. 

This  kind  of  product  may  be  called  the  scalar  product  of  the  three 
vectors.  There  are  two  other  kinds  of  products  of  three  vectors,  both 
of  which  are  vectors,  viz.,  products  of  the  type  (a./3)y  or  y(a.j8),  and 
products  of  the  type  ax[j8xy]  or  [yx/J]xa. 

25.  i.jxk~j.kxi**k.i*j=l.        i.kxj=k.jxi=j.ixk—  —  1. 

From  these  equations,  which  follow  immediately  from  those  of 
No.  17,  the  propositions  of  the  last  section  might  have  been  derived, 
viz.,  by  substituting  for  a,  ft,  and  y,  respectively,  expressions  of  the 
form  xi+yj+zk,  x'i+y'j+z'k,  and  x'i+y'j+z"k*  Such  a  method, 
which  may  be  called  expansion  in  terms  of.  i,j,  and  Jfe,  will  on  many 
occasions  afford  very  simple,  although  perhaps  lengthy,  demonstrations. 

26.  Triple  products  containing  only  two  different  letters. — The 
significance  and  the  relations  of  (a. a) ft,  (a. ft) a,  and  ax[ax#|  will 
be  most  evident,  if  we  consider  j8  as  made  up  of  two  components, 
fX  and  {T,  respectively  parallel  and  perpendicular  to  a.    Then 

(a.p)a=(a.P)a~(a.a)P, 
ax[aX/8]  =  ax[aX/8*]«  -(<*.a)j8". 
Hence,  ax[ax#|»(a.j8)a-(a.a)j8. 

27.  Oeneral  relation  of  the  vector  products  of  three  factors. — In  the 
triple  product  ax(j8xy]  we  may  set 

a  =  J0+my+n/8xy, 
unless  p  and  y  have  the  same  direction.    Then 
ax[/8xy]=^x[/8xy]+myxC8xy] 

-{lp.y+™y.y)p-{lp.p+rriy.P)y. 
But  J)8.y+my.y=*a.y,  and  l/3.fi+my.fi**a.fi. 

Therefore  ax[£xy]=(«.y)£-(a.j8)y, 

which  is  evidently  true,  when  p  and  y  have  the  same  directions.  It 
may  also  be  written 

[yX/3]xa=0(y.a)-yO8.a). 


*  The  student  who  it  familiar  with  the  nature  of  determinants  will  not  fail  to  observe 
that  the  triple  product  a./Sx?  is  the  determinant  formed  by  the  nine  rectangular 
components  of  a,  Bt  and  y,  nor  that  the  rectangular  components  of  ax/3  are  determinants 
of  the  second  order  formed  from  the  components  of  a  and  0.  (See  the  last  equation  of 
No.  21.) 
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28.  This  principle  may  be  used  in  the  transformation  of  more 
complex  products.  It  will  be  observed  that  its  application  will 
always  simultaneously  eliminate,  or  introduce,  two  signs  of  skew 
multiplication. 

The  student  will  easily  prove  the  following  identical  equations, 
which,  although  of  considerable  importance,  are  here  given  principally 
as  exercises  in  the  application  of  the  preceding  formulas. 

29.  ax[j8xy]+£x[yxa]+yx[axj8]  =  0. 

30.  [ax/8].[yxfl«(a.y)(i8^)-(a.*)08.y). 
81.                [ax£]x[yxfl=(a.yx<$)j8-08.yx£)a 

=(a.pxS)y-(a.pXy)S. 
32.  ax[j8x[yxfl]=(a.yX*)/8-(a.£)yx* 

=0MaXy-08.y)aX& 
38.      [aXi8].[yx5]x[«X^=(a./8x*)(y.«xf)-(a./3xy)(^«xD 
=(a./3x€)(f.yx^)-(a.i8xf)(€.yx5) 
=(y.*Xa)03.exf)-(y.*X/8)(a.exfr 

34.  [axfl.D8xyMyXaMa.0Xy)1. 

35.  The  student  will  also  easily  convince  himself  that  a  product 
formed  of  any  number  of  letters  (representing  vectors)  combined  in 
any  possible  way  by  scalar,  direct,  and  skew  multiplications  may  be 
reduced  by  the  principles  of  Nos.  24  and  27  to  a  sum  of  products, 
each  of  which  consists  of  scalar  factors  of  the  forms  a. ft  and  a.^Sxy, 
with  a  single  vector  factor  of  the  form  a  or  ax/8,  when  the  original 
product  is  a  vector. 

36.  Elimination  of  scalars  from  vector  equations. — It  has  already 
been  observed  that  the  elimination  of  vectors  from  equations  of  the 
form 

aa+fy8+cy+cW+eta=0 

is  performed  by  the  same  rule  as  the  eliminations  of  ordinary  algebra. 
(See  No.  9.)  But  the  elimination  of  scalars  from  such  equations  is  at 
least  formally  different  Since  a  single  vector  equation  is  the  equi- 
valent of  three  scalar  equations,  we  must  be  able  to  deduce  from  such 
an  equation  a  scalar  equation  from  which  two  of  the  scalars  which 
appear  in  the  original  vector  equation  have  been  eliminated.  We 
shall  see  how  this  may  be  done,  if  we  consider  the  scalar  equation 

(w.X+6£.X+cy.X+<W.X+etc  =  0, 

which  is  derived  from  the  above  vector  equation  by  direct  multipli- 
cation by  a  vector  X.  We  may  regard  the  original  equation  as  the 
equivalent  of  the  three  scalar  equations  obtained  by  substituting  for 
a,  j8,  y,  &,  etc,  their  X-,  Y-,  and  Z-components.  The  second  equation 
would  be  derived  from  these  by  multiplying  them  respectively  by 
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the  X-,  Y-,  and  Z-components  of  X  and  adding.  Hence  the  second 
equation  may  be  regarded  as  the  most  general  form  of  a  scalar  equa- 
tion of  the  first  degree  in  a,  b,  c,  d,  etc,  which  can  be  derived  from 
the  original  vector  equation  or  its  equivalent  three  scalar  equations. 
If  we  wish  to  have  two  of  the  scalars,  as  b  and  c,  disappear,  we  have 
only  to  choose  for  X  a  vector  perpendicular  to  0  and  y.  Such  a 
vector  is  £xy.    We  thus  obtain 

aa.j8xy+cW.j8xy+eta=0. 

37.  Relations  of  fowr  vectors. — By  this  method  of  elimination  we 
may  find  the  values  of  the  coefficients  a,  6,  and  c  in  the  equation 

p  =  aa+&£+cy,  (1) 

by  which  any  vector  p  is  expressed  in  terms  of  three  others.  (See 
No.  10.)    If  we  multiply  directly  by  0xy,  yXa,  and  aX/8,  we  obtain 

p.0xy=aa.j8xy,    p.yXa=60.yxa,    />.aX0=*cy.aXj8;       (2) 
whence 

a-^g**,    6-*££.    *-fc£l  (3) 

a.£xy'  a./8xy'  a.£xy  v  ' 

By  substitution  of  these  values,  we  obtain  the  identical  equation, 

(a.jSxy)  />=(/>.£xy)  a  +  (p.yXa)  0  +  (p.aX/3)  y.  (4) 

(Compare  No.  31.)  If  we  wish  the  four  vectors  to  appear  symmetri- 
cally in  the  equation  we  may  write 

(a.£xy)/>  -  08.yx/>)a  +  (y.pXa)  j8  -  (p.aX/8)y=0.         (5) 
If  we  wish  to  express  p  as  a  sum  of  vectors  having  directions 
perpendicular  to  the  planes  of  a  and  0,  of  ft  and  y,  and  of  y  and  a, 
we  may  write 

/>=6£xy+/yXa+0ceX/8.  (6) 

To  obtain  the  values  of  e,  /,  g,  we  multiply  directly  by  a,  by  £,  and 
by  y.    This  gives 

P-0        ««- £2- 


*£.yXa'    ^y.aXjS'    '"a./Sxy'  (7) 

Substituting  these  values  we  obtain  the  identical  equation 

(a.j8xy)/>=0>.a)/3xy  +  (/>.£)  yXa  +  (p.y)ax£  (8) 

(Compare  No.  32.) 

38.  Reciprocal  systems  of  vectors, — The  results  of  the  preceding 
section  may  be  more  compactly  expressed  if  we  use  the  abbreviations 

,      fixy        ry      yXa  ,       ax£  ,.. 

a~^'  P-J&Ca'  yV«xl8-  (1) 

The  identical  equations  (4)  and  (8)  of  the  preceding  number  thus 
become 

p  =  0>.a/)a+(/>./3/))8+(p.y/)y,  (2) 

p=(p.a)a'+(p./8)j8>(p.y)y'.  (3) 
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We  may  infer  from  the  similarity  of  these  equations  that  the  relations 
of  a,  )8,  y,  and  a,  {¥,  y  are  reciprocal,  a  proposition  which  is  easily 
proved  directly.    For  the  equations 

a-ZFZ/>  p-t?h>  *"yf&  (4) 

are  satisfied  identically  by  the  substitution  of  the  values  of  a ,  f¥, 
and  y  given  in  equations  (1).    (See  Nos.  31  and  84t) 

Dfc/. — It  will  be  convenient  to  use  the  term  reciprocal  to  designate 
these  relations,  Le.,  we  shall  say  that  three  vectors  are  reciprocals  of 
three  others,  when  they  satisfy  relations  similar  to  those  expressed  in 
equations  (1)  or  (4). 

With  this  understanding  we  may  say : — 

The  coefficients  by  which  any  vector  is  expressed  in  terms  of 
three  other  vectors  are  the  direct  products  of  that  vector  with  the 
reciprocals  of  the  three. 

Among  other  relations  which  are  satisfied  by  reciprocal  systems 
of  vectors  are  the  following: 

a.a  =/3.f¥—y.y  -1, 
a.jS'-O,    a.y'~0,    £.a'«0,    £.y'«0,    y.a'-O,    y.^-0.      (5) 
These  nine  equations  may  be  regarded  as  defining  the  relations 
between  a,  0,  y  and  a',  f¥,  y  as  reciprocals, 

(a.pxy)(a'.pxy')-l.  (6) 

(See  No.  34.)  axa'+j8x/8'+yxy'«0.  (7) 

(See  No.  29.) 

A  system  of  three  mutually  perpendicular  unit  vectors  is  reciprocal 
to  itself,  and  only  such  a  system. 
The  identical  equation 

P=(p.i)i+(pj)j+(p.k)k  (8) 

may  be  regarded  as  a  particular  case  of  equation  (2). 
The  system  reciprocal  to  aXjS,  0xy,  yxa  is 
a'xfr    Pxy\    y'xa', 

°*  g  )8  y 

a.0xy'    a.fixy9    a.pxy' 

38a.  If  we  multiply  the  identical  equation  (8)  of  No.  37  by  o-xt, 
we  obtain  the  equation 

(a.)8xy)0>.<rXT)=a.p08.<ry.T— fi>ry.*) 

+/3.p(y.o- o.t — y.T  a.a-)+y.p(a.o- )8.t— a.T /8.<r), 

which  is  therefore  identical  But  this  equation  cannot  subsist  identi- 
cally, unless 

(o.  j8xy)crXT  =«  a(J3.<r  y.T— £.t  y.o-) 

+)8(y.<r  a.T— y.T  a.<r)+y(a.<r  j8.T— o.t  )8.<r) 
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is  also  an  identical  equation.  (The  reader  will  observe  that  in  each 
of  these  equations  the  second  member  may  be  expressed  as  a  deter- 
minant) 

From  these  transformations,  with  those  already  given,  it  follows 
that  a  product  formed  of  any  number  of  letters  (representing  vectors 
and  scalars),  combined  in  any  possible  .way  by  scalar,  direct,  and  skew 
multiplications,  may  be  reduced  to  a  sum  of  products,  containing 
each  the  sign  x  once  and  only  once,  when  the  original  product 
contains  it  an  odd  number  of  times,  or  entirely  free  from  the  sign, 
when  the  original  product  contains  it  an  even  number  of  times. 

89.  Scalar  equations  of  the  first  degree  with  respect  to  an  wnhnown 
vector. — It  is  easily  shown  that  any  scalar  equation  of  the  first 
degree  with  respect  to  an  unknown  vector  p,  in  which  all  the  other 
quantities  are  known,  may  be  reduced  to  the  form 

p.a**a, 
in  which  a  and  a  are  known.    (See  No.  85.)    Three  such  equations 
will  afford  the  value  of  p  (by  equation  (8)  of  No,  87,  or  equation  (3) 
of  No.  88),  which  may  be  used  to  eliminate  p  from  any  other  equation 
either  scalar  or  vector. 

When  we  have  four  scalar  equations  of  the  first  degree  with 
respect  to  p,  the  elimination  may  be  performed  most  symmetrically 
by  substituting  the  values  of  />.a,  etc,  in  the  equation 

(p.a)()8.yX<)-(/>.i8)(y^Xa)+(p.y)(5.aXi8)-0>.^(a.i8xy)-0, 
which   is   obtained   from   equation  (8)  of  No.  87  by  multiplying 
directly  by  S.    It  may  also  be  obtained  from  equation  (5)  of  No.  87 
by  writing  S  for  pt  and  then  multiplying  directly  by  p. 

40.  Solution  of  a  vector  equation  of  the  first  degree  with  respect  to 
the  unknown  vector. — It  is  now  easy  to  solve  an  equation  of  the 

form  *-a<A.p)+j8(M*)+y<"#*  (1) 

where  a,  /J,  y,  8,  X,  /a,  and  v  represent  known  vectors.  Multiplying 
directly  by  £xy,  by  yxo,  and  by  aXjS,  we  obtain 

£.yx3«08.yXa)(X./>),    y.aX*«(y.aXj8)G*./>X 

a.£x*-(a.£Xy)(v./>); 

or  a'.S^X.p,    jS'.jap./o,    y'.S^v.p, 

where  a',  {¥,  y  are  the  reciprocals  of  a,  j8,  y.  Substituting  these 
values  in  the  identical  equation 

P-V<X.*)+MW)+*'fr.pX 
in  which  X',  fi\  v  are  the  reciprocals  of  X,  pt  v  (see  No.  38),  we  have 

P=XVJ)+M'08^)HV(y^),  (2) 

which  is  the  solution  required. 
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It  results  from  the  principle  stated  in  No.  35,  that  any  vector 
equation  of  the  first  degree  with  respect  to  p  may  be  reduced  to  the 

form  6=a(\.p)+P(p.p)+y(v.p)+ap+€Xp. 

But  a/>=aX'(X./>)  t-ayu'0*./>)+ai/(v./>), 

and  eXp=€X\'(\.p)+€Xfi'(ji.p)+cXv'(v.p), 

where  X',  fi\  v'  represent,  as  before,  the  reciprocals  of  X,  /a,  v.    By 

substitution  of  these  valies  the  equation  is  reduced  to  the  form  of 

equation  (1),  which  may  therefore  be  regarded  as  the  most  general 

form  of  a  vector  equation  of  the  first  degree  with  respect  to  p. 

41.  Bdatlcms  between  two  normal  systems  of  unit  vectors. — If 
i,j,  k,  and  i',j',  V  are  two  normal  systems  of  unit  vectors,  we  have 

j'=(i.?)i+(j-j')j+(lcm  \  (1) 

and  <-(  W+<  i.j')j'+(i.V)K  ) 

i-0'.*X+(i^V+0'.iO^  I  (2) 

k-(kAW+(k.j')j'+(k.k')V.  J 
(See  equation  (8)  of  No.  38.) 

The  nine  coefficients  in  these  equations  are  evidently  the  cosines 
of  the  nine  angles  made  by  a  vector  of  one  system  with  a  vector  of 
the  other  system.  The  principal  relations  of  these  cosines  are  easily 
deduced.  By  direct  multiplication  of  each  of  the  preceding  equations 
with  itself,  we  obtain  six  equations  of  the  type 

(U7+0'.;7+(fc.i')'=i.  (3) 

By  direct  multiplication  of  equations  (1)  with  each  other,  and  of 
equations  (2)  with  each  other,  we  obtain  six  of  the  type 

(*.*0  (<•  jO+O'-O  O'-i'HM  (*./)-o.  (4) 

By  skew  multiplication  of  equations  (1)  with  each  other,  we  obtain 
three  of  the  type 

*'  =  {0'.»')(A:./)-(*.i')0'./)}*+{(*.*'')(*-/)-(i.t')(*/)}J 

+{<<.O0"/)-(m,W>>* 

Comparing  these  three  equations  with  the  original  three,  we  obtain 
nine  of  the  type 

i.V=(j.i')(k.j')-(k.i')(j.j').  (5) 

Finally,  if  wo  equate  the  scalar  product  of  the  three  right  hand 
members  of  (1)  with  that  of  the  three  left  hand  members,  we  obtain 

-(*-O(i-i0(<-^)-(M0(i.JO(<-O-(*.iOO'.<0(<-i0-i-      («) 

Equations  (1)  and  (2)  (if  the  expressions  in  the  parentheses  are 
supposed  replaced  by  numerical  values)  represent  the  linear  relations 
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which  subsist  between  one  vector  of  one  system  and  the  three  vectors 
of  the  other  system.  If  we  desire  to  express  the  similar  relations 
which  subsist  between  two  vectors  of  one  system  and  two  of  the  other, 
we  may  take  the  skew  products  of  equations  (1)  with  equations  (2), 
after  transposing  all  terms  in  the  latter.  This  will  afford  nine 
equations  of  the  type 

(i.?W-(i.V)j'=(k.i')3-U.i')k  (7) 

We  may  diyide  an  equation  by  an  indeterminate  direct  factor.    [MS.  note  by  author.] 


CHAPTER  II. 
CONCERNING  THE  DIFFERENTIAL  AND  INTEGRAL  CALCULUS  OF  VECTORS. 

42.  Differentials  of  vectors. — The  differential  of  a  vector  is  the 
geometrical  difference  of  two  values  of  that  vector  which  differ 
infinitely  little.  It  is  itself  a  vector,  and  may  make  any  angle  with 
the  vector  differentiated.  It  is  expressed  by  the  same  sign  (d)  as  the 
differentials  of  ordinary  analysis. 

With  reference  to  any  fixed  axes,  the  components  of  the  differential 
of  a  vector  are  manifestly  equal  to  the  differentials  of  the  components 
of  the  vector,  i.e.,  if  a,  ft,  and  y  are  fixed  unit  vectors,  and 

p=xa+yfi+zyt 

dp  =  dxa+dy  ft+dzy. 

43.  Differential  ofafwnction  of  several  variables. — The  differential 
of  a  vector  or  scalar  function  of  any  number  of  vector  or  scalar 
variables  is  evidently  the  sum  (geometrical  or  algebraic,  according  as 
the  function  is  vector  or  scalar)  of  the  differentials  of  the  function 
due  to  the  separate  variation  of  the  several  variables. 

44.  Differential  of  a  product. — The  differential  of  a  product  of  any 
kind  due  to  the  variation  of  a  single  factor  is  obtained  by  prefixing 
the  sign  of  differentiation  to  that  factor  in  the  product  This  is 
evidently  true  of  differentials,  since  it  will  hold  true  even  of  finite 
differences. 

45.  From    these    principles    we    obtain    the    following   identical 


equations : 


d(a+£)=da+d/3,  (1) 

d(na)=dn,a+nda,  (2) 

d(a.p)=da.p+a.dpt  (3) 

d[aXj8]=dax/8+ax<*A  (4) 

d(a.pxy)=da.pxy+a.dpxy+a.pxdy,  (5) 

d[(a.P)y]=(da.P)y+(a.dp)y+(«-P)dv-  (6) 
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46.  Differential  coefficient  with  respect  to  a  scalar. — The  quotient 
obtained  by  dividing  the  differential  of  a  vector  due  to  the  variation 
of  any  scalar  of  which  it  is  a  function  by  the  differential  of  that 
scalar  is  called  the  differential  coefficient  of  the  vector  with  respect 
to  the  scalar,  and  is  indicated  in  the  same  manner  as  the  differential 
coefficients  of  ordinary  analysis. 

If  we  suppose  the  quantities  occurring  in  the  six  equations  of  the 

last  section  to  be  functions  of  a  scalar  t,  we  may  substitute  -%  f or  d  in 

those  equations  since  this  is  only  to  divide  all  terms  by  the  scalar  dt. 

47.  Successive  differentiations. — The  differential  coefficient  of  a 
vector  with  respect  to  a  scalar  is  of  course  a  finite  vector,  of  which 
we  may  take  the  differential,  or  the  differential  coefficient  with 
respect  to  the  same  or  any  other  scalar.  We  thus  obtain  differential 
coefficients  of  the  higher  orders,  which  are  indicated  as  in  the  scalar 
calculus. 

A  few  examples  will  serve  for  illustration. 

If  p  is  the  vector  drawn  from  a  fixed  origin  to  a  moving  point  at 

any  time  tt  -£  will  be  the  vector  representing  the  velocity  of  the 

point,  and  -^  the  vector  representing  its  acceleration. 

If  p  is  the  vector  drawn  from  a  fixed  origin  to  any  point  on  a 
curve,  and  s  the  distance  of  that  point  measured  on  the  curve  from 

any  fixed  point,  -^  is  a  unit  vector,  tangent  to  the  curve  and  having 

the  direction  in  which  s  increases;  -t4  is  a  vector  directed  from  a 

point  on  the  curve  to  the  center  of  curvature,  and  equal  to  the 

curvature ;  -£  x  -y4  is  the  normal  to  the  osculating  plane,  directed  to 

the  side  on  which  the  curve  appears  described  counter-clockwise 
about  the  center  of  curvature,  and  equal  to  the  curvature.  The 
tortuosity  (or  rate  of  rotation  of  the  osculating  plane,  considered  as 
positive  when  the  rotation  appears  counter-clockwise  as  seen  from 
the  direction  in  which  8  increases)  is  represented  by 

d*p   dtp 

48.  Integration  of  am,  equation  between  differentials. — If  t  and  u 
are  two  single- valued  continuous  scalar  functions  of  any  number  of 
scalar  or  vector  variables,  and 

dt*=du, 
then  £su+a, 

where  a  is  a  scalar  constant. 
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Or,  if  r  and  «  are  two  single- valued  continuous  vector  functions 
of  any  number  of  scalar  or  vector  variables,  and 

ctr»d», 
then  T««+a, 

where  a  is  a  vector  constant 

When  the  above  hypotheses  are  not  satisfied  in  general,  but  will  be 
satisfied  if  the  variations  of  the  independent  variables  are  confined 
within  certain  limits,  then  the  conclusions  will  hold  within  those  limits, 
provided  that  we  can  pass  by  continuous  variation  of  the  independent 
variables  from  any  values  within  the  limits  to  any  other  values 
within  them,  without  transgressing  the  limits. 

49.  So  far,  it  will  be  observed,  ail  operations  have  been  entirely 
analogous  to  those  of  the  ordinary  calculus. 

Functions  of  Position  in  Space. 

50.  Def — If  u  is  any  scalar  function  of  position  in  space  (Le.,  any 
scalar  quantity  having  continuously  varying  values  in  space),  Vu  is 
the  vector  function  of  position  in  space  which  has  everywhere  the 
direction  of  the  most  rapid  increase  of  u,  and  a  magnitude  equal  to 
the  rate  of  that  increase  per  unit  of  length.  Vu  may  be  called  the 
derivative  of  tt,  and  tt,  the  primitive  of  Vu. 

We  may  also  take  any  one  of  the  Nob.  51,  52,  58  for  the  definition 
of  Vu. 

51.  If  p  is  the  vector  defining  the  position  of  a  point  in  space, 

du**Vu.dp. 
rA  —       .du  ,  .dtt  ,  ,du 

w*  •  dm     .  —        du     .  —        du    .  _ 

•*•         sr*-v*  a&-*7*  n'kVn- 

54.  Def, — If  m  is  a  vector  having  continuously  varying  values  in  space, 

V.m  is  called  the  divergence  of  m  and  Vx«  its  ewi. 

U  we  set  ,-X»+Xj+», 

we  obtain  by  substitution  the  equations 

_       dX    dY    dZ 
V'm'd^+d^+H 
,  _         ./dZ    dY\,./dJ.    dZ\,,(dY    <*X\ 

which  may  also  be  regarded  as  defining  V.«*  and  Vx». 
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55.  Surface-integrals. — The  integral  ffu.dv,  in  which  d<r  represents 
an  element  of  some  surface,  is  called  the  surface-integral  of  w  for  that 
surface.  It  is  understood  here  and  elsewhere,  when  a  vector  is  said 
to  represent  a  plane  surface  (or  an  element  of  surface  which  may  be 
regarded  as  plane),  that  the  magnitude  of  the  vector  represents  the 
area  of  the  surface,  and  that  the  direction  of  the  vector  represents 
that  of  the  normal  drawn  toward  the  positive  side  of  the  surface. 
When  the  surface  is  denned  as  the  boundary  of  a  certain  space,  the 
outside  of  the  surface  is  regarded  as  positive. 

The  surface-integral  of  any  given  space  (i.e.,  the  surface-integral 
of  the  surf  ace  bounding  that  space)  is  evidently  equal  to  the  sum  of 
the  surface-integrals  of  all  the  parts  into  which  the  original  space 
may  be  divided  For  the  integrals  relating  to  the  surfaces  dividing 
the  parts  will  evidently  cancel  in  such  a  sum. 

The  surface-integral  of  w  for  a  closed  surface  bounding  a  space  dv 
infinitely  small  in  all  its  dimensions  is 

V.cadv. 

This  follows  immediately  from  the  definition  of  V»,  when  the  space 
is  a  parallelopiped  bounded  by  planes  perpendicular  to  i,  j,  k  In 
other  cases,  we  may  imagine  the  space — or  rather  a  space  nearly 
coincident  with  the  given  space  and  of  the  same  volume  dv — to  be 
divided  up  into  such  parallelopipeds.  The  surface-integral  for  the 
space  made  up  of  the  parallelopipeds  will  be  the  sum  of  the  surface- 
integrals  of  all  the  parallelopipeds,  and  will  therefore  be  expressed  by 
V.wdv.  The  surface-integral  of  the  original  space  will  have  sensibly 
the  same  value,  and  will  therefore  be  represented  by  the  same 
formula.  It  follows  that  the  value  of  V.»  does  not  depend  upon  the 
system  of  unit  vectors  employed  in  its  definition. 

It  is  possible  to  attribute  such  a  physical  signification  to  the 
quantities  concerned  in  the  above  proposition,  as  shall  make  it  evident 
almost  without  demonstration.  Let  us  suppose  w  to  represent  a  flux 
of  any  substance.  The  rate  of  decrease  of  the  density  of  that  sub- 
stance at  any  point  will  be  obtained  by  dividing  the  surface-integral 
of  the  flux  for  any  infinitely  small  closed  surface  about  the  point  by 
the  volume  enclosed.  This  quotient  must  therefore  be  independent  of 
the  form  of  the  surface.  We  may  define  V.w  as  representing  that 
quotient,  and  then  obtain  equation  (1)  of  No.  54  by  applying  the 
general  principle  to  the  case  of  the  rectangular  parallelopiped. 

56.  Skew  surface-integrals. — The  integral  ffdtrw  may  be  called 
the  skew  surface-integral  of  ».  It  is  evidently  a  vector.  For  a 
closed  surface  bounding  a  space  dv  infinitely  small  in  all  dimensions 
this  integral  reduces  to  Vxoodv,  as  is  easily  shown  by  reasoning  like 
that  of  No.  55. 
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57.  Integration. — If  dv  represents  an  element  of  any  space!  and  da- 
an  element  of  the  bounding  surface, 

fffV.wdv=fft*.d<r. 

For  the  first  member  of  this  equation  represents  the  sum  of  the 
surface-integrals  of  all  the  elements  of  the  given  space.  We  may 
regard  this  principle  as  affording  a  means  of  integration,  since  we 
may  use  it  to  reduce  a  triple  integral  (of  a  certain  form)  to  a  double 
integral 

The  principle  may  also  be  expressed  as  follows : 

The  surface-integral  of  any  vector  function  of  position  in  space  for 
a  closed  surface  is  equal  to  the  volume-integral  of  the  divergence  of 
that  function  for  the  space  enclosed. 

58.  IAme^vategraU. — The  integral  fw.  dp,  in  which  dp  denotes  the 
element  of  a  line,  is  called  the  line-vntegral  of  w  for  that  line.  It  is 
implied  that  one  of  the  directions  of  the  line  is  distinguished  as 
positive.  When  the  line  is  regarded  as  bounding  a  surface,  that  side 
of  the  surface  will  always  be  regarded  as  positive,  on  which  the 
surface  appears  to  be  circumscribed  counter-clockwise. 

59.  Integration. — From  No.  51  we  obtain  directly 

fVu.dp=*tt"— u', 

where  the  single  and  double  accents  distinguish  the  values  relating 
to  the  beginning  and  end  of  the  line. 

In  other  words, — The  line-integral  of  the  derivative  of  any  (con- 
tinuous and  single-valued)  scalar  function  of  position  in  space  is  equal 
to  the  difference  of  the  values  of  the  function  at  the  extremities  of 
the  line.    For  a  closed  line  the  integral  vanishes. 

60.  Integration. — The  following  principle  may  be  used  to  reduce 
double  integrals  of  a  certain  form  to  simple  integrals. 

If  do-  represents  an  element  of  any  surface,  and  dp  an  element  of 
the  bounding  line, 

Jjf'Vx^.da-^/w.dp. 

In  other  words, — The  line-integral  of  any  vector  function  of  position 
in  space  for  a  closed  line  is  equal  to  the  surface-integral  of  the  curl  of 
that  function  for  any  surface  bounded  by  the  line. 

To  prove  this  principle,  we  will  consider  the  variation  of  the 
line-integral  which  is  due  to  a  variation  in  the  closed  line  for  which 
the  integral  is  taken.    We  have,  in  the  first  place, 

Sjw.dp  =j8<*.dp  +jw.  S  dp. 
But  w.8dp=d(oo.$p)—dco.$p. 

Therefore,  since  fd(fit>.Sp)=Q  for  a  closed  line, 

8 f W.  dp  *=/&<*.dp  —fdu.Sp. 
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Now  fc-xGW-*!^*)]' 

and  *-2[^]-2g(i.^)], 

where  the  summation  relates  to  the  coordinate  axes  and  connected 
quantities.     Substituting  these   values   in  the  preceding  equation, 

or  by  No.  80, 

tf*-dp-fz[iXft\  •  [SPxdp]-fVxw.[Spxdp]. 

But  Spxdp  represents  an  element  of  the  surface  generated  by  the 
motion  of  the  element  dp,  and  the  last  member  of  the  equation  is 
the  surface-integral  of  Vx«  for  the  infinitesimal  surface  generated 
by  the  motion  of  the  whole  line.  Hence,  if  we  conceive  of  a  closed 
curve  passing  gradually  from  an  infinitesimal  loop  to  any  finite  form, 
the  differential  of  the  line-integral  of  *  for  that  curve  will  be  equal 
to  the  differential  of  the  surface  integral  of  Vx*  for  the  surface 
generated:  therefore,  since  both  integrals  commence  with  the  value 
lero,  they  must  always  be  equal  to  each  other.  Such  a  mode  of 
generation  will  evidently  apply  to  any  surface  closing  any  loop. 

61.  The  line-integral  of  m  for  a  closed  line  bounding  a  plane  surface 
cfcr  infinitely  small  in  ail  its  dimensions  is  therefore 

Vx».&r. 

This  principle  affords  a  definition  of  Vx«  which  is  independent 
of  any  reference  to  coordinate  axes.  If  we  imagine  a  circle  described 
about  a  fixed  point  to  vary  its  orientation  while  keeping  the  same 
sue,  there  will  be  a  certain  position  of  the  circle  for  which  the  line- 
integral  of  *  will  be  a  maximum,  unless  the  line-integral  vanishes  for 
all  positions  of  the  circle.  The  axis  of  the  circle  in  this  position, 
drawn  toward  the  side  on  which  a  positive  motion  in  the  circle 
appears  counter-clockwise,  gives  the  direction  of  Vx»,  and  the 
quotient  of  the  integral  divided  by  the  area  of  the  circle  gives  the 
magnitude  of  Vx* 

V,  V. ,  cmd  Vx  applied  to  Functions  of  Functions  of  Position. 

62.  A  constant  scalar  factor  after  V,  V.,  or  Vx  may  be  placed 
before  the  symbol 

68.  If  /(u)  denotes  any  scalar  function  of  ft,  and  f(u)  the  derived 
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64.  If  tt  or  *  is  a  function  of  several  scalar  or  vector  variables 
whkh  are  themselves  functions  of  the  position  of  a  single  point,  the 
value  of  V*  or  V.»  or  Vx#»  will  be  equal  to  the  sum  of  the  values 
obtained  by  making  successively  all  bat  each  one  of  these  variables 
constant. 

66.  By  the  use  of  this  principle  we  easily  derive  the  following 
identical  equations: 

V(*+«)«V*+Vik  (1) 

V.(t+»)=V.t+V.«.      Vx[t+«#]«Vxt+Vx«.  (2) 

V(*tt)««Vt+tVu.  (3) 

V.(u*)=»c#.Vu+uV.«.  (4) 

Vx[W)«t/Vx»-»xVu.  (5) 

V.[tx«]«c#.Vxt— r.Vx«.  (6) 

The  student  will  observe  an  analogy  between  these  equations  and 
the  formulae  of  multiplication.  (In  the  last  four  equations  the 
analogy  appears  most  distinctly  when  we  regard  all  the  factors  but 
one  as  constant.)  Some  of  the  more  curious  features  of  this  analogy 
are  due  to  the  fact  that  the  V  contains  implicitly  the  vectors  i,  j 
and  Jb,  which  are  to  be  multiplied  into  the  following  quantities. 

Combinations  of  the  Operators  V,  V.,  and  Vx. 

66.  If  u  is  any  scalar  function  of  position  in  space, 

VxV*«0, 

as  may  be  derived  directly  from  the  definitions  of  these  operators. 

67.  Conversely,  if  »  is  such  a  vector  function  of  position  in  space 

vxc#*»0, 

« is  the  derivative  of  a  scalar  function  of  position  in  space.    This  will 
appear  from  the  following  considerations : 

The  line-integraiy« .  dp  will  vanish  for  any  closed  line,  since  it  may 
be  expressed  as  the  surface-integral  of  Vx«.  (No.  60.)  The  line- 
integral  taken  from  one  given  point  F  to  another  given  point  F'  is 
independent  of  the  line  between  the  points  for  which  the  integral 
is  taken.  (For,  if  two  lines  joining  the  same  points  gave  different 
values,  by  reversing  one  we  should  obtain  a  closed  line  for  which  the 
integral  would  not  vanish.)  If  we  set  u  equal  to  this  line-integral, 
supposing  F'  to  be  variable  and  F  to  be  constant  in  position,  u  will 
be  a  scalar  function  of  the  position  of  the  point  P",  satisfying  the 
condition  du^w.dp,  or,  by  No.  51,  Vu««*  There  will  evidently  be 
an  infinite  number  of  functions  satisfying  this  condition,  which  will 
differ  from  one -another  by  constant  quantities. 


Digitized  by  VjOOQIC 


36  VECTOR  ANALYSIS. 

If  the  region  for  which  Vx«=0  is  unlimited,  these  functions  will 
be  single- valued  If  the  region  is  limited,  but  acyclic,*  the  functions 
will  still  be  single-valued  and  satisfy  the  condition  Vu=»  within  the 
same  region.  If  the  region  is  cyclic,  we  may  determine  functions 
satisfying  the  condition  Vu=a>  within  the  region,  but  they  will  not 
necessarily  be  single-valued. 

68.  If  «  is  any  vector  function  of  position  in  space,  V.Vx»»0. 
This  may  be  deduced  directly  from  the  definitions  of  No.  54 

The  converse  of  this  proposition  will  be  proved  hereafter. 

69.  If  u  is  any  scalar  function  of  position  in  space,  we  have  by 
Nos.  52  and  54 


*M£*£+S)* 


70.  Def. — If  «  is  any  vector  function  of  position  in  space,  we  may 
define  V.Vu  by  the  equation 


V.Vft): 


/d1      d1      cP\ 


the  expression  V.  V  being  regarded,  for  the  present  at  least,  as  a  single 
operator  when  applied  to  a  vector.  (It  will  be  remembered  that  no 
meaning  has  been  attributed  to  V  before  a  vector.)  It  should  be 
noticed  that  if 

w-iX+jY+ifeZ, 

V.  V«> = iV.VX  + jV.  VY + ifeV.  VZ, 

that  is,  the  operator  V.V  applied  to  a  vector  affects  separately  its 
scalar  components. 

71.  From  the  above  definition  with  those  of  Nos.  52  and  54  we 
may  easily  obtain 

V.V«=W.ft>-VxVx«>. 

The  effect  of  the  operator  V.V  is  therefore  independent  of  the  direc- 
tions of  the  axes  used  in  its  definition. 

72.  The  expression  ~£a*V.Va,  where  a  is  any  infinitesimal  scalar, 
evidently  represents  the  excess  of  the  value  of  the  scalar  function  u 


*If  every  oloeed  line  within  a  given  region  oan  contract  to  a  single  point  without 
breaking  its  oontinnity,  or  passing  out  of  the  region,  the  region  is  oalled  acydict  other- 
wisecyeftc 

A  cyolio  region  may  be  made  aoyolio  by  diaphragms,  whioh  must  then  be  regarded  as 
forming  part  of  the  surface  bounding  the  region,  eaoh  diaphragm  contributing  its  own 
area  twioe  to  that  surface.  This  process  may  be  used  to  reduce  many-valued  functions 
of  position  in  space,  having  single-valued  derivatives^  to  single-valued  funotions. 

When  funotions  are  mentioned  or  implied  in  the  notation,  the  reader  will  always 
understand  single-valued  functions,  unless  the  contrary  is  distinctly  intimated,  or  the 
case  is  one  in  whioh  the  distinction  is  obviously  immaterial.  Diaphragms  may  be 
applied  to  bring  funotions  naturally  many-valued  under  the  application  of  some  of  the 
following  theorems,  as  Nos.  74  ff. 
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at  the  point  considered  above  the  average  of  its  values  at  six  points 
at  the  following  vector  distances:  ai,  —  ai,  aj,  —  aj,  ok,  —ak.  Since 
the  directions  of  i,  j,  and  k  are  immaterial  (provided  that  they  are 
at  right  angles  to  each  other),  the  excess  of  the  value  of  u  at  the 
central  point  above  its  average  value  in  a  spherical  surface  of  radius 
a  constructed  about  that  point  as  the  center  will  be  represented  by 
the  same  expression,  —  JafV.Vu. 

Precisely  the  same  is  true  of  a  vector  function,  if  it  is  understood 
that  the  additions  and  subtractions  implied  in  the  terms  average  and 
excess  are  geometrical  additions  and  subtractions. 

Maxwell  has  called  —  V.Vu  the  concentration  of  u,  whether  u  is 
scalar  or  vector.  We  may  call  V.Vu  (or  V.V»),  which  is  proportioned 
to  the  excess  of  the  average  value  of  the  function  in  an  infinitesimal 
spherical  surface  above  the  value  at  the  center,  the  dispersion  of 
u  (or  «). 

Transforination  of  Definite  Integrals. 

73.  From  the  equations  of  No.  65,  with  the  principles  of  integration 
of  Nos.  57,  59,  and  60,  we  may  deduce  various  transformations  of 
definite  integrals,  which  are  entirely  analogous  to  those  known  in  the 
scalar  calculus  under  the  name  of  integration  by  parts.  The  follow- 
ing formulas  (like  those  of  Nos.  57,  59,  and  60)  are  written  for  the 
case  of  continuous  values  of  the  quantities  (scalar  and  vector)  to  which 
the  signs  V,  V.,  and  Vx  are  applied.  It  is  left  to  the  student  to 
complete  the  formulae  for  cases  of  discontinuity  in  these  values.  The 
manner  in  which  this  is  to  be  done  may  in  each  case  be  inferred  from 
the  nature  of  the  formula  itself.  The  most  important  discontinuities 
of  scalars  are  those  which  occur  at  surfaces :  in  the  case  of  vectors 
discontinuities  at  surfaces,  at  lines,  and  at  points,  should  be  considered. 

74.  From  equation  (3)  we  obtain 

J^ituydp^ru'^tu^/uVt.dp+ftVu.dp, 

where  the  accents  distinguish  the  quantities  relating  to  the  limits  of 
the  line-integrals.  We  are  thus  able  to  reduce  a  line-integral  of  the 
form  fuVt.  dp  to  the  form  —ftVu.dp  with  quantities  free  from  the 
sign  of  integration. 

75.  From  equation  (5)  we  obtain 

ffV%{vA»).d<T^fvA».dp^JfuVxta.d(T^Jfwyyu.dWt 

where,  as  elsewhere  in  these  equations,  the  line-integral  relates  to  the 
boundary  of  the  surface-integral. 

From  this,  by  substitution  of  Vt  for  a>,  we  may  derive  as  a 
particular  case 

ffVu%Vt.d<r=fuVt.dp=  -ftVu.dp. 
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76.  From  equation  (4)  we  obtain 

where,  as  elsewhere  in  these  equations,  the  surface-integral  relates  to 
the  boundary  of  the  volume-integrals. 

From  this,  by  substitution  of  Vt  for  »,  we  derive  as  a  particular  case 

which  is  Green's  Theorem.    The  substitution  of  sVt  for  «  gives  the 
more  general  form  of  this  theorem  which  is  due  to  Thomson,  vis., 

-Jft*V*.d*-ffftV\8Vv,]dv. 

77.  From  equation  (6)  we  obtain 

A  particular  case  is 

J%fVu.Vx*  dv  -ff*x  Vu.d<r. 

Integration  of  Differential  Equations. 

78.  If  throughout  any  continuous  space  (or  in  all  space) 

Vu-0, 
then  throughout  the  same  space 

u= constant 

79.  If  throughout  any  continuous  space  (or  in  all  space) 

V.Vu-0, 
and  in  any  finite  part  of  that  space,  or  in  any  finite  surface  in  or 
bounding  it, 

Vu=0, 

then  throughout  the  whole  space 

Vu«0,  and  u= constant 

This  will  appear  from  the  following  considerations : 
If  Vu«0  in  any  finite  part  of  the  space,  u  is  constant  in  that  part 
If  tt  is  not  constant  throughout,  let  us  imagine  a  sphere  situated 
principally  in  the  part  in  which  u  is  constant,  but  projecting  slightly 
into  a  part  in  which  u  has  a  greater  value,  or  else  into  a  part  in 
which  u  has  a  less.  The  surface-integral  of  Vu  for  the  part  of  the 
spherical  surface  in  the  region  where  u  is  constant  will  have  the 
value  zero :  for  the  other  part  of  the  surface,  the  integral  will  be 
either  greater  than  zero,  or  less  than  zero.  Therefore  the  whole 
surface-integral  for  the  spherical  surface  will  not  have  the  value  zero, 
which  is  required  by  the  general  condition,  V.Vu=0. 
Again,  if  Vu=0  only  in  a  surface  in  or  bounding  the  space  in 
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which  V.VutbO,  u  will  be  constant  in  this  surface,  and  the  surface 
will  be  contiguous  to  a  region  in  which  V.Vu«0  and  u  has  a  greater 
value  than  in  the  surface,  or  else  a  less  value  than  in  the  surface. 
Let  us  imagine  a  sphere  lying  principally  on  the  other  aide  of  the 
surface,  but  projecting  slightly  into  this  region,  and  let  us  particularly 
consider  the  surface-integral  of  Vu  for  the  small  segment  cut  off  by 
the  surface  Vu=0.  The  integral  for  that  part  of  the  surface  of  the 
segment  which  consists  of  part  of  the  surface  Vu«0  will  have  the 
value  zero,  the  integral  for  the  spherical  part  will  have  a  value  either 
greater  than  zero  or  else  less  than  zero.  Therefore  the  integral  for 
the  whole  surface  of  the  segment  cannot  have  the  value  zero,  which 
is  demanded  by  the  general  condition,  V.Vu«0. 

80.  If  throughout  a  certain  space  (which  need  not  be  continuous, 
and  which  may  extend  to  infinity) 

V.Vtt-Q, 

and  in  all  the  bounding  surfaces 

u  »  constant"*  a, 

and  (in  case  the  space  extends  to  infinity)  if  at  infinite  distances 
within  the  space  u—o, — then  throughout  the  space 
Vu*0,  and  u«a. 

For,  if  anywhere  in  the  interior  of  the  space  Vu  has  a  value 
different  from  zero,  we  may  find  a  point  P  where  such  is  the  case, 
and  where  u  has  a  value  6  different  from  a, — to  fix  our  ideas  we  will 
say  leas.  Imagine  a  surface  enclosing  all  of  the  space  in  which  u  <  6. 
(This  must  be  possible,  since  that  part  of  the  space  does  not  reach  to 
infinity.)  The  surface-integral  of  Vu  for  this  surface  has  the  value 
zero  in  virtue  of  the  general  condition  V.Vu«0.  But,  from  the 
manner  in  which  the  surface  is  defined,  no  part  of  the  integral  can  be 
negative.  Therefore  no  part  of  the  integral  can  be  positive,  and  the 
supposition  made  with  respect  to  the  point  P  is  untenable.  That  the 
supposition  that  bo  is  untenable  may  be  shown  in  a  similar 
manner.    Therefore  the  value  of  u  is  constant 

This  proposition  may  be  generalized  by  substituting  the  condition 
V.[*Vtt]=0  for  V.Vtt«0,  t  denoting  any  positive  (or  .any  negative) 
scalar  function  of  position  in  space.  The  conclusion  would  be  the 
same,  and  the  demonstration  similar. 

81.  If  throughout  a  certain  space  (which  need  not  be  continuous, 
and  which  may  extend  to  infinity) 

V.Vu-0, 
and  in  all  the  bounding  surfaces  the  normal  oompbnent  of  Vu  vanishes, 
and  at  infinite  distances  within  the  space  (if  such  there  are)  r1-*-  — 0, 
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where  r  denotes  the  distance  from  a  fixed  origin,  then  throughout  the 
space 

Vu=0, 

and  in  each  continuous  portion  of  the  same 

u= constant. 

For,  if  anywhere  in  the  space  in  question  Vu  has  a  value  different 
from  zero,  let  it  have  such  a  value  at  a  point  P,  and  let  u  be  there 
equal  to  b.  Imagine  a  spherical  surface  about  the  above-mentioned 
origin  as  center,  enclosing  the  point  P,  and  with  a  radius  r.  Con- 
sider that  portion  of  the  space  to  which  the  theorem  relates  which  is 
within  the  sphere  and  in  which  u<b.  The  surface  integral  of  Vu 
for  this  space  is  equal  to  zero  in  virtue  of  the  general  condition 
V.Vt*=0.  That  part  of  the  integral  (if  any)  which  relates  to  a 
portion  of  the  spherical  surface  has  a  value  numerically  not  greater 

than  4*^ -r-J ,  where  I  -j-j  denotes  the  greatest  numerical  value  of 

-T-  in  the  portion  of  the  spherical  surface  considered.    Hence,  the 

value  of  this  part  of  the  surface-integral  may  be  made  less  (numeri- 
cally) than  any  assignable  quantity  by  giving  to  r  a  sufficiently  great 
value.  Hence,  the  other  part  of  the  surface-integral  (viz.,  that  relating 
to  the  surface  in  which  u=6,  and  to  the  boundary  of  the  space  to 
which  the  theorem  relates)  may  be  given  a  value  differing  from  zero 
by  less  than  any  assignable  quantity.  But  no  part  of  the  integral 
relating  to  this  surface  can  be  negative.  Therefore  no  part  can  be 
positive,  and  the  supposition  relative  to  the  point  P  is  untenable. 
This  proposition  also  may  be  generalized  by  substituting  V.[tVu]  »0 

for  V.Vu=0,  and  fo^-O  for  ^=0. 

82.  If  throughout  any  continuous  space  (or  in  all  space) 

V*«Vu, 

then  throughout  the  same  space 

lau+const. 

The  truth  of  this  and  the  three  following  theorems  will  be  apparent  if 
we  consider  the  difference  *— u. 

83.  If  throughout  any  continuous  space  (or  in  all  space) 

V.V*«V.Vti, 

and  in  any  finite  part  of  that  space,  or  in  any  finite  surface  in  or 
bounding  it, 

V*=Vu, 

then  throughout  the  whole  space 

Vt=Vu,  and  *=»u+ const 
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84.  If  throughout  a  certain  space  (which  need  not  be  continuous, 
and  which  may  extend  to  infinity) 

V.V*«V.Vu, 
and  in  all  the  bounding  surfaces 

and  at  infinite  distances  within  the  space  (if  such  there  are) 
then  throughout  the  space 

85.  If  throughout  a  certain  space  (which  need  not  be  continuous, 
and  which  may  extend  to  infinity) 

V.Vt-V.Vu, 
and  in  all  the  bounding  surfaces  the  normal  components  of  Vt  and  Vu 
are  equal,  and  at  infinite  distances  within  the  space  (if  such  there  are) 

r*^-— -j-JaO,  where  r  denotes  the  distance  from  some  fixed  origin, 

— then  throughout  the  space 

Vt-Vu, 

and  in  each  continuous  part  of  which  the  space  consists 

t —us  constant. 

86.  If  throughout  any  continuous  space  (or  in  all  space) 

Vxt«Vx«  and  V.t— V.», 
and  in  any  finite  part  of  that  space,  or  in  any  finite  surface  in  or 
bounding  it, 

then  throughout  the  whole  space 

For,  since  Vx(t-«)=0,  we  may  set  Vi*=»t— «,  making  the  space 
acyclic  (if  necessary)  by  diaphragms.  Then  in  the  whole  space  u  is 
single-valued  and  V.Vu»0,  and  in  a  part  of  the  space,  or  in  a  surface 
in  or  bounding  it,  Vw=0.    Hence  throughout  the  space  Vu=t— »=0. 

87.  If  throughout  an  aperiphractic*  space  contained  within  finite 
boundaries  but  not  necessarily  continuous 

Vxt*=Vx«  and  V.t=V.», 

and  in  all  the  bounding  surfaces  the  tangential  components  of  r  and 
c#  are  equal,  then  throughout  the  space 

It  is  evidently  sufficient  to  prove  this  proposition  for  a  continuous 
space.     Setting  Vu=t— »,  we  have  V.Vu=0  for  the  whole  space, 

•  If  a  sptoe  encloses  within  itself  another  spaoe,  it  is  oalled  periphractic,  otherwise 
aperiphmetic 
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and  u= constant  for  its  boundary,  which  will  be  a  single  surface  for  a 
continuous  aperiphractic  space.    Hence  throughout  the  space 

88.  If  throughout  an  acyclic  space  contained  within  finite  boundaries 
but  not  necessarily  continuous 

Vxt»Vx«>  and  V.t=»V.«, 

and  in  all  the  bounding  surfaces  the  normal  components  of  r  and  » 
are  equal,  then  throughout  the  whole  space 

T  =  «. 

Setting  Vtt=T— «,  we  have  V.Vu=0  throughout  the  space,  and 
the  normal  component  of  Vu  at  the  boundary  equal  to  zero.  Hence 
throughout  the  whole  space  Vu=t— a>=0. 

89.  If  throughout  a  certain  space  (which  need  not  be  continuous, 
and  vhich  may  extend  to  infinity) 

V.Vt«V.V« 
and  in  all  the  bounding  surfaces 

T  =  «, 

and  at  infinite  distances  within  the  space  (if  such  there  are) 

then  throughout  the  whole  space 

This  will  be  apparent  if  we  consider  separately  each  of  the  scalar 
components  of  t  and  *. 

Mmvmwra  Values  of  the  Vdurne-integrcUjffuw.adv. 
{Thomson's  Theorems.) 

90.  Let  it  be  required  to  determine  for  a  certain  space  a  vector 
function  of  position  »  subject  to  certain  conditions  (to  be  specified 
hereafter),  so  that  the  volume-integral 

Jfjffuv.adv 
for  that  space  shall  have  a  minimum  value,  u  denoting  a  given 
posftivp  scalar  function  of  position. 

•.a.  In  the  first  place,  let  the  vector  «  be  subject  to  the  conditions 
that  V.«  is  given  within  the  space,  and  that  the  normal  component 
of  «  is  given  for  the  bounding  surface.  (This  component  must  of 
course  be  such  that  the  surface-integral  of  »  shall  be  equal  to  the 
volume-integral  fV.wdv.  If  the  space  is  not  continuous,  this  must 
be  true  of  each  continuous  portion  of  it.  See  No.  57.)  The  solution 
is  that  Vx(tu»)»=0,  or  more  generally,  that  the  line-integral  of  vm  for 
any  closed  curve  in  the  space  shall  vanish. 
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The  existence  of  the  minimum  requires  that 

Jfflu*.fadv=0, 

while  Sw  is  subject  to  the  limitation  that 

V.&»«=0, 

and  that  the  normal  component  of  8te  at  the  bounding  surface  vanishes. 
To  prove  that  the  line-integral  of  uw  vanishes  for  any  closed  curve 
within  the  space,  let  us  imagine  the  curve  to  be  surrounded  by  an 
infinitely  slender  tube  of  normal  section  dz,  which  may  be  either 
constant  or  variable.     We  may  satisfy  the  equation  V.<$««0  by 

making  &.-0  outride  of  the  tube,  and  &.<fe=*a^  within  it,  *» 

denoting  an  arbitrary  infinitesimal  constant,  p  the  position-vector,  and 
ds  an  element  of  the  length  of  the  tube  or  closed  curve.    We  have  then 

whence  fu*.dp*0.  Q.EJ). 

We  may  express  this  result  by  saying  that  u»  is  the  derivative  of  a 
single-valued  scalar  function  of  position  in  space.    (See  No.  67.) 

If  for  certain  parts  of  the  surface  the  normal  component  of  » is  not 
given  for  each  point,  but  only  the  surface-integral  of  »  for  each  such 
part,  then  the  above  reasoning  will  apply  not  only  to  closed  curves, 
but  also  to  curves  commencing  and  ending  in  such  a  part  of  the 
surface  The  primitive  of  vm  will  then  have  a  constant  value  in 
each  such  part 

If  the  space  extends  to  infinity  and  there  is  no  special  condition 
respecting  the  value  of  «  at  infinite  distances,  the  primitive  of  um 
will  have  a  constant  value  at  infinite  distances  within  the  space  or 
within  each  separate  continuous  part  of  it. 

If  we  except  those  cases  in  which  the  problem  has  no  definite 
meaning  because  the  data  are  such  that  the  integral  futa.ta  dv  must 
be  infinite,  it  is  evident  that  a  minimum  must  always  exist,  and  (on 
account  of  the  quadratic  form  of  the  integral)  that  it  is  unique.  That 
the  conditions  just  found  are  sufficient  to  insure  this  minimum,  is 
evident  from  the  consideration  that  any  allowable  values  of  8te  may 
be  made  up  of  such  values  as  we  have  supposed.  Therefore,  there 
will  be  one  and  only  one  vector  function  of  position  in  space  which 
satisfies  these  conditions  together  with  those  enumerated  at  the 
beginning  of  this  number. 

b.  In  the  second  place,  let  the  vector  «  be  subject  to  the  conditions 
that  Vxo>  is  given  throughout  the  space,  and  that  the  tangential 
component  of  « is  given  at  the  bounding  surface.    The  solution  is  that 

V.[u«]»0, 
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and,  if  the  space  is  periphractic,  that  the  surface-integral  of  uw 
vanishes  for  each  of  the  bounding  surfaces. 
The  existence  of  the  minimum  requires  that 

Jj(j/uw.8todv=09 
while  8te  is  subject  to  the  conditions  that 

Vx&»=0, 
and  that  the  tangential  component  of  Soo  in  the  bounding  surface 
vanishes.    In  virtue  of  these  conditions  we  may  set 

8»=V8qt 
where  Sq  is  an  arbitrary  infinitesimal  scalar  function  of  position,  sub- 
ject only  to  the  condition  that  it  is  constant  in  each  of  the  bounding 
surfaces.    (See  No.  67.)    By  substitution  of  this  value  we  obtain 

ffJu<*.V&qdv=Q, 
or  integrating  by  parts  (No.  76) 

J/uw.<^Sq-Jj(fV.[u<a]8qdv^0. 

Since  Sq  is  arbitrary  in  the  volume-integral,  we  have  throughout  the 
whole  space 

and  since  Sq  has  an  arbitrary  constant  value  in  each  of  the  bounding 
surfaces  (if  the  boundary  of  the  space  consists  of  separate  parts),  we 
have  for  each  such  part 

Jjfv,(o.d<rt=Q. 

Potentials,  Newtonians,  Laplacians. 

91.  Def. — If  vf  is  the  scalar  quantity  of  something  situated  at  a 
certain  point  />',  the  potential  of  u  for  any  point  p  is  a  scalar  function 
of  p,  defined  by  the  equation 

and  the  Newtonian  of  v!  for  any  point  p  is  a  vector  function  of  p 
defined  by  the  equation 


newu  i 


P~P 


Again,  if  w  is  the  vector  representing  the  quantity  and  direction  of 
something  situated  at  the  point  p\  the  potential  and  the  Laplacian  of 
»'  for  any  point  p  are  vector  functions  of  p  defined  by  the  equations 


lap»  =p4 — £r  Xw. 
IP  -PR 
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92.  If  »  or  «  is  a  scalar  or  vector  function  of  position  in  space, 
we  may  write  Pott*,  Nowtt,  Pot**,  Lap»  for  the  volume-integrals  of 
potu',  etc,  taken  as  functions  of  p'\  Le.,  we  may  set 

Pot«-/J7pot  «W  =fff^LgW, 

New«-^newttW=^j4=^5uW, 

PoU-^pot  m'M  -ffftf=fito, 

where  the  p  is  to  be  regarded  as  constant  in  the  integration.  This 
extends  over  all  space,  or  wherever  the  u'  or  m  have  any  values 
other  than  sera  These  integrals  may  themselves  be  called  (integral) 
potentials,  Newtonians,  and  Laplacians. 

QQ  rfPotu    p  ,du        dPot*    «„*&• 

dx  dx  due  ax 

This  will  be  evident  with  respect  both  to  scalar  and  to  vector  functions, 
if  we  suppose  that  when  we  differentiate  the  potential  with  respect 
to  x  (thus  varying  the  position  of  the  point  for  which  the  potential 
is  taken)  each  element  of  volume  ck/  in  the  implied  integral  remains 
fixed,  not  in  absolute  position,  but  in  position  relative  to  the  point 
for  which  the  potential  is  taken.  This  supposition  is  evidently  allow- 
able whenever  the  integration  indicated  by  the  symbol  Pot  tends  to  a 
definite  limit  when  the  limits  of  integration  are  indefinitely  extended. 

Since  we  may  substitute  y  and  z  f or  x  in  the  preceding  formula, 
and  since  a  constant  factor  of  any  kind  may  be  introduced  under  the 
sign  of  integration,  we  have 

VPotu=PotVu, 

V.Pot««=PotV.», 

VxPot»=PptVx«, 

V.VPotu=PotV.Vu, 

V.VPot«=PotV.V», 

i.e.,  the  symbols  V,  V.,  Vx,  V.V  may  be  applied  indifferently  before 

or  after  the  sign  Pot. 

Tet  a  certain  restriction  is  to  be  observed.  When  the  operation  of 
taking  the  (integral)  potential  does  not  give  a  definite  finite  value, 
the  first  members  of  these  equations  are  to  be  regarded  as  entirely 
indeterminate,  but  the  second  members  may  have  perfectly  definite 
values.  This  would  be  the  case,  for  example,  if  u  or  «  had  a  constant 
value  throughout  all  space.  It  might  seem  harmless  to  set  an  inde- 
finite expression  equal  to  a  definite,  but  it  would  be  dangerous,  since 
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we  might  with  equal  right  set  the  indefinite  expression  equal  to  other 
definite  expressions,  and  then  be  misled  into  supposing  these  definite 
expressions  to  be  equal  to  one  another.  It  will  be  safe  to  say  that 
the  above  equations  will  hold,  provided  that  the  potential  of  u  or  & 
has  a  definite  value.  It  will  be  observed  that  whenever  Potu  or 
Pot  co  has  a  definite  value  in  general  (i.e.,  with  the  possible  exception 
of  certain  points,  lines,  and  surfaces),*  the  first  members  of  all  these 
equations  will  have  definite  values  in  general,  and  therefore  the 
second  members  of  the  equation,  being  necessarily  equal  to  the  first 
members,  when  these  have  definite  values,  will  also  have  definite 
values  in  general. 
94.  Again,  whenever  Potu  has  a  definite  value  we  may  write 


where  r 

stands  for 

[p- 

-/>],.    But 
r 

.*?, 

whence 

VPottt* 

= New  u. 

Moreover,  New  u  will  in  general  have  a  definite  valine,  if  Pot  u  has. 

95.  In  like  manner,  whenever  Pot  a  has  a  definite  value, 

VxPot«-Vx^(^=^^x^=^V^x«W. 

Substituting  the  value  of  V  -  given  above  we  have 

VxPoto)=Lapo). 

Lapw  will  have  a  definite  value  in  general  whenever  Pot »  has. 

96.  Hence,  with  the  aid  of  No.  93,  we  obtain 

VxLap»=LapVx«, 
V.Lap»=0, 

whenever  Pot  w  has  a  definite  value. 

97.  By  the  method  of  No.  93  we  obtain 

V.New  u= V.fff£^£  u'dv'=J?J'Vu\e^aW. 

To  find  the  value  of  this  integral,  we  may  regard  the  point  />,  which 
is  constant  in  the  integration,  as  the  center  of  polar  coordinates. 
Then  r  becomes  the  radius  vector  of  the  point  p,  and  we  may  set 

<&/=*•*  dgdr, 

*  Whenever  it  is  said  that  a  funotion  of  position  in  spaoe  has  a  definite  value  in 
general,  this  phrase  is  to  be  understood  as  explained  above.  The  term  definite  is 
intended  to  exolude  both  indeterminate  and  infinite  values. 


Google 
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where  r*dq  is  the  element  of  a  spherical  surface  having  center  at  p 
and  radius  r.    We  may  also  set 

r        dr 
We  thus  obtain 

V^ewu^//j/^dqdr^^J^dr^4^\mm^4^u'rm.9 

where  u  denotes  the  average  value  of  u  in  a  spherical  surface  of 
radius  r  about  the  point  p  as  center. 

Now  if  Potu  has  in  general  a  definite  value,  we  must  have  ff=«0 
for  r »  oo.  Also,  V.  New  u  will  have  in  general  a  definite  value.  For 
r=0,  the  value  of  u'  is  evidently  u.    We  have,  therefore, 

V.Newu=— 4xu, 

V.VPotu*-4rtfc* 

98.  If  Pot*  has  in  general  a  definite  value, 
V.V  Pot «»  V.V  Pot  [wi+yj+uk] 

= V.V  Pot  ui+ V.  V  Pot  vj + V.V  Pot  tt* 
■■  — 4rt*i— 4irt0— 4rtdb 
»  — 4ir». 
Hence,  by  No.  71, 

VxVxPot«-W.Pot»«4ir». 

Thftis,  LapVx«— NewV.»=4ir». 

If  we  set  1  t       rr  —  I  vt      tt 

•i«T-LapVx«,     «4«-TrrNewV.«, 

we  have  «««i+t4, 

where  «*  and  «,  are  such  functions  of  position  that  V.e^—0,  and 
Vxi4»0.  This  is  expressed  by  saying  that  «|  is  solenoidal,  and  a, 
tmrfotfonol.  Pot^  and  Pot»j,  like  Pot«,  will  have  in  general 
definite  values. 

It  is  worth  while  to  notice  that  there  is  only  one  way  in  which  a 
vector  function  of  position  in  space  having  a  definite  potential  can 
be  thus  divided  into  solenoidal  and  irrotational  parts  having  definite 
potentials.    For  if  *l+€,  «^— e  are  two  other  such  parts, 

V.e«0  and  Vx«»0. 

Moreover,  Pote  has  in  general  a  definite  value,  and  therefore 

€«T-LapVx€-T-NewV.€«a  Q.u>. 


that :  WFDt««jG()rj;V.V«*«j^ 
f/uV-.dc=-4m,    [MR  note  by  author.] 
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99.  To  assist  the  memory  of  the  student,  some  of  the  principal 
results  of  Noa  93-98  may  be  expressed  as  follows : 

Let  a>!  be  any  solenoidal  vector  function  of  position  in  space,  tog  any 
irrotational  vector  function,  and  u  any  scalar  function,  satisfying  the 
conditions  that  their  potentials  have  in  general  definite  values. 

With  respect  to  the  solenoidal  function  m^  j-  Lap  and  Vx  are 
inverse  operators ;  Le., 

Sj:LapVx»1  =  Vx^:Lap«1=»1. 

Applied  to  the  irrotational  function  »t,  either  of  these  operators  gives 
zero;  Le., 

Lapa>2=0,    Vxft),=0. 

With  respect  to  the  irrotational  function  «,,  or  the  scalar  function  u, 

x-  New  and  — V.  are  inverse  operators;  i.e., 

— j-New  V.«,=«,,    — V.j~Newu=u. 

Applied  to  the  solenoidal  function  wv  the  operator  V.  gives  zero ;  i.e. 

V.w^O. 
Since  the  most  general  form  of  a  vector  function  having  in  general 
a  definite  potential  may  be  written  c^+^s*  the  effect  of  these  operators 
on  such  a  function  needs  no  especial  mention. 

With  respect  to  the  solenoidal  function  o^,  j-  Pot  and  VxVx  are 
inverse  operators ;  Le., 

-^  Pot  VxVxa^Vx^j;  Pot  VXft^VxVx^j;  Pot  w^c^. 

With  respect  to  the  irrotational  function  »2,  7-  Pot  and  —  W.  are 
inverse  operators ;  Le., 

--^  Pot  VV.w^  -V^j;  Pot  V.a*=  -  VV.^Potw^w,. 

With  respect  to  any  scalar  or  vector  function  having  in  general  a 
definite  potential  j—  Pot  and  —  V.  V  are  inverse  operators ;  Le., 

-jj  Pot  V.Vtt  =  -  V.jjj;  Pot  Vu=  -  V.V^  Potu«ft, 

-^  Pot  V.Vfo+wJ*-  -  V.V^:Pot[«1+«J=»l+«2. 

With  respect  to  the  solenoidal  function  i^,  —  V.V  and  VxVx  are 
equivalent ;  with  respect  to  the  irrotational  function  cog  V.V  and  W 
are  equivalent;  Le., 

-V.Vo>1=VxVx»1t      V.Vo^^VV.wj. 
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100.  On  the  interpretation  of  the  preceding  formufa. — Infinite 
values  of  the  quantity  which  occurs  in  a  volume-integral  as  the 
coefficient  of  the  element  of  volume  will  not  necessarily  make  the 
value  of  the  integral  infinite,  when  they  are  confined  to  certain 
surfaces,  lines,  or  points.  Tet  these  surfaces,  lines,  or  points  may 
contribute  a  certain  finite  amount  to  the  value  of  the  volume-integral, 
which  must  be  separately  calculated,  and  in  the  case  of  surfaces  or 
lines  is  naturally  expressed  as  a  surface-  or  line-integral.  Such  cases 
are  easily  treated  by  substituting  for  the  surface,  line,  or  point,  a 
very  thin  shell,  or  filament,  or  a  solid  very  small  in  all  dimensions, 
within  which  the  function  may  be  supposed  to  have  a  very  large 
value. 

The  only  cases  which  we  shall  here  consider  in  detail  are  those  of 

surfaces  at  which  the  functions  of  position  (u  or  a>)  are  discontinuous, 

and  the  values  of  Vu,  Vx«,  V.a>  thus  become  infinite.     Let  the 

function  u  have  the  value  Uj  on  the  side  of  the  surface  which  we 

regard  as  the  negative,  and  the  value  u,  on  the  positive  side.    Let 

Att=f*j— Uj.    If  we  substitute  for  the  surface  a  shell  of  very  small 

thickness  a,  within  which  the  value  of  u  varies  uniformly  as  we  pass 

Vu 
through  the  shell,  we  shall  have  Vu=v  —  within  the  shell,  v  denoting 

a  unit  normal  on  the  positive  side  of  the  surface.  The  elements  of 
volume  which  compose  the  shell  may  be  expressed  by  a[der\9  where 
[cio-Jo  is  the  magnitude  of  an  element  of  the  surface,  cUr  being  the 
vector  element.    Hence, 

Vudv=vAu[do-]0s*Aud<T. 

Hence,  when  there  are  surfaces  at  which  the  values  of  u  are 
discontinuous,  the  full  value  of  Pot  Vu  should  always  be  understood 
as  including  the  surface-integral 

relating  to  such  surfaces.  (Am'  and  dtf  are  accented  in  the  formula 
to  indicate  that  they  relate  to  the  point  p.) 

In  the  case  of  a  vector  function  which  is  discontinuous  at  a  surface, 
the  expressions  V.adv  and  Vx»^,  relating  to  the  element  of  the 
shell  which  we  substitute  for  the  surface  of  discontinuity,  are  easily 
transformed  by  the  principle  that  these  expressions  are  the  direct 
and  skew  surface-integrals  of  »  for  the  element  of  the  shell  (See 
Nos,  55,  56.)  The  part  of  the  surface-integrals  relating  to  the  edge 
of  the  element  may  evidently  be  neglected,  and  we  Bhall  have 

Vx«dv»d<rx«4— da-x^i^dcxAw. 
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Whenever,  therefore,  a>  is  discontinuous  at  surfaces,  the  expressions 
PotV.tt  and  NewV.a  must  be  regarded  as  implicitly  including  the 
surface-integrals 

respectively,  relating  to  such  surfaces,  and  the  expressions  Pot  Vx«* 
and  Lap  Vx«  as  including  the  surface-integrals 

%'^d<r'xAw'  "*  # £=&&**** 

respectively,  relating  to  such  surfaces. 

101.  We  have  already  seen  that  if  w  is  the  curl  of  any  vector 
function  of  position,  V.a>™0.  (No.  68.)  The  converse  is  evidently 
true,  whenever  the  equation  V.w»0  holds  throughout  all  space,  and 
a)  has  in  general  a  definite  potential ;  for  then 

a>«Vxj-Lapa). 

Again,  if  V.«=0  within  any  aperiphractic  space  A,  contained 
within  finite  boundaries,  we  may  suppose  that  space  to  be  enclosed  by 
a  shell  B  having  its  inner  surface  coincident  with  the  surface  of  A. 
We  may  imagine  a  function  of  position  «',  such  that  a/=a>  in  A,  a/«0 
outside  of  the  shell  B,  and  the  integral  jjfw'.wdv  for  B  has  the  least 
value  consistent  with  the  conditions  that  the  normal  component  of  w 
at  the  outer  surface  is  zero,  and  at  the  inner  surface  is  equal  to  that 
of  a,  and  that  in  the  shell  W=0  (compare  No.  90).  Then  V.«>'»0 
throughout  all  space,  and  the  potential  of  w'  will  have  in  general  a 
definite  value.     Hence, 

«  mVxT-Lapw', 

and  o>  will  have  the  same  value  within  the  space  A. 

tl02.  Def. — If  u  is  a  vector  function  of  position  in  space,  the  Max- 
weUicm  *  of  »  is  a  scalar  function  of  position  defined  by  the  equation 

(Compare  No.  92.)    From  this  definition  the  following  properties  are 

easily  derived.    It  is  supposed  that  the  functions  w  and  u  are  such 

that  their  potentials  have  in  general  definite  values. 

Max  w  =x  V.  Pot  a)  a>  Pot  V.«, 

V  Max  oo = W.Pot  a  »  New  V.a>, 

Max  Vu  »  —  4»u,- 

4xa>*sVxLapa>— VMaxp. 

*  The  frequent  ooourrenoe  of  the  integral  in  MaxweU'a  TrecUue  on  EUetrieUp  cmd 
Magtutkm  ham  suggested  this  name. 
t(The  foregoing  portion  of  this  paper  was  printed  in  1881,  the  rest  in  1884.] 
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K  the  values  of  Lap  Lap  m,  New  Max  m,  and  Max  New  Hare  in  general 
definite,  we  may  add 

4*  Pot  *« Lap  Lap*— New  Max*, 

4xPotv«  -Max  New  u. 

In  other  words :  The  Maxwellian  is  the  divergence  of  the  potential, 

— -7—  and  V  are  inverse  operators  for  scalars  and  irrotatkmal  vectors, 

1 
for  vectors  in  general  —  j-VMaxisan  operator  which  separates  the 

irrotational  from  the  solenoidal  part  For  scalars  and  irrotational 
vectors,  ^-MaxNew  and  ^— New  Max  give  the  potential,  for  sole- 
noidal vectors  -t— Lap  Lap  gives  the  potential,  for  vectors 'in  general 

—  1  1 

-7— New  Max  gives  the  potential  of  the  irrotational  part  and  7— Lap 

Lap  the  potential  of  the  solenoidal  part 

103.  Def, — The  following  doable  volume-integrals  are  of  frequent 
occurrence  in  physical  problems.  They  are  all  scalar  quantities,  and 
none  of  them  functions  of  position  in  space,  as  are  the  single  volume- 
integrals  which  we  have  been  considering.  The  integrations  extend 
over  all  space,  or  as  far  as  the  expression  to  be  integiated  has  values 
other  than  aero. 

The  mutual  potential,  or  potential  product,  of  two  scalar  functions 
of  position  in  space  is  defined  by  the  equation 

Pot(u,  w)-fff/Jjr^<k<W 

In  the  double  volume-integral,  r  is  the  distance  between  the  two 
elements  of  volume,  and  u  relates  to  dv  as  u/  to  dif. 

The  mutual  potential,  or  potential  product,  of  two  vector  functions 
of  position  in  space  is  defined  by  the  equation 

The  mutual  Laplacian,  or  Laplaeian  product,  of  two  vector 
functions  of  position  in  space  is  denned  by  the  equation 

^{+.*)-Mff*^x+'<hd</ 

~fffm.l*$$dv**fff$.lM$*dv. 

The  Nexatonian  product  oi  a  scalar  and  a  vector  function  of  position 
in  space  is  denned  by  the  equation 

New(u,«)«j^^ 
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The  MaaweUicm  product  of  a  vector  and  a  scalar  function  of 
position  in  space  is  defined  by  the  equation 

It  is  of  course  supposed  that  utw}<j>}w  are  such  functions  of  position 
that  the  above  expressions  have  definite  values. 

104.  By  No.  97, 

4jruPott0=—  V.NewttPott0=—  V.[NewwPotw]  +  Newtt.Newm 

The  volume-integral  of  this  equation  gives 

4x  Pot  (w,  w)  =jQjy*New  u.  New  w  dv, 
if  the  integral 

j(7*d<r.New  u  Pot  wt 

for  a  closed  surface,  vanishes  when  the  space  included  by  the  surface 
is  indefinitely  extended  in  all  directions.  This  will  be  the  case  when 
everywhere  outside  of  certain  assignable  limits  the  values  of  u  and  w 
are  zero. 

Again,  by  No.  102, 
49ro).Pot  #= VxLap  o.Pot  0-  V  Max  a>.Pot  <p 
=  V.[Lapa>xPot^]  +  Lap  a). Lap  ^ 

—  V.[Max«Pot^]  +  MaxuMax^. 
The  volume-integral  of  this  equation  gives 

4x  Pot  (</>,  co)  =/X/*Lap  #.Lap  o>  dv  + jQj/'Max  ^  Max  o>  dv, 
if  the  integrals 

j^der.Lap  »  x  Pot  <f>,  ffdtr  J?ot  <f>  Max  w, 

for  a  closed  surface  vanish  when  the  space  included  by  the  surface  is 
indefinitely  extended  in  all  directions.  This  will  be  the  case  if  every- 
where outside  of  certain  assignable  limits  the  values  of  <f>  and  o>  are 
zero. 

CHAPTER  III. 

CONCERNING  LINEAR  VECTOR  FUNCTIONS. 

105.  Def. — A  vector  function  of  a  vector  is  said  to  be  linear,  when 
the  function  of  the  sum  of  any  two  vectors  is  equal  to  the  sum  of  the 
functions  of  the  vectors.    That  is,  if 

func.  0 + p] = func.  [p] + f una  [p] 

for  all  values  of  p  and  p',  the  function  is  linear.  In  such  cases  it  is 
easily  shown  that 

func.[ap+&//+cp"+etc.]=afimc.[>^ 
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106.  An  expression  of  the  form 

aX.p+/3/*.p+stc. 
evidently  represents  a  linear  function  of  p,  and  may  be  conveniently 
written  in  the  form 

{a\+/J/u+eta}.p. 

The  expression  p.aX+p./S/i+etc., 

or  p.{aA+/9/K+etc.}, 

.also  represents  a  linear  function  of  p,  which  is,  in  general,  different 
from  the  preceding,  and  will  be  called  its  conjugate* 

107.  Def. — An  expression  of  the  form  oA  or  fifi  will  be  called  a 
dyad.  An  expression  consisting  of  any  number  of  dyads  united  by 
the  signs  +  or  —  will  be  called  a  dyadic  binomial,  trinomial,  etc, 
as  the  case  may  be,  or  more  briefly,  a  dyadic.  The  latter  term  will 
be  used  so  as  to  include  the  case  of  a  single  dyad.  When  we  desire 
to  express  a  dyadic  by  a  single  letter,  the  Greek  capitals  will  be  used, 
except  such  as  are  like  the  Roman,  and  also  A  and  2.  The  letter  I 
will  also  be  used  to  represent  a  certain  dyadic,  to  be  mentioned 
hereafter. 

Since  any  linear  vector  function  may  be  expressed  by  means  of  a 
dyadic  (as  we  shall  see  more  particularly  hereafter,  see  No.  110),  the 
study  of  such  functions,  which  is  evidently  of  primary  importance  in 
the  theory  of  vectors,  may  be  reduced  to  that  of  dyadics. 

108.  Def. — Any  two  dyadics  $  and  ¥  are  equal, 

when  $.p**¥.p  for  all  values  of  p, 
or,  when  p.$=*p/¥k  for  all  values  of  p, 
or,  when   <r&.p=*cr&.p  for  all  values  of  <r  and  of  p. 

The  third  condition  is  easily  shown  to  be  equivalent  both  to  the  first 
and  to  the  second.    The  three  conditions  are  therefore  equivalent. 

It  follows  that  $=*,  if  $.p=¥.p,  or  p.$=xp.¥,  for  three  non- 
complanar  values  of  p. 

109.  Def. — We  shall  call  the  vector  $.p  the  (direct)  product  of 
$  and  p,  the  vector  p.$  the  (direct)  product  of  p  and  $,  and  the 
scalar  cr&.p  the  (direct)  product  of  <r,  $,  and  p. 

In  the  combination  $.p,  we  shall  say  that  $  is  used  as  a  prefaetor, 
in  the  combination  p.$,  as  a  postfaotor. 

110.  If  t  is  any  linear  function  of  p,  and  for  p=i,  p**j,  p=Jfe,  the 
values  of  t  are  respectively  a,  /8,  and  y,  we  may  set 

and  also  f .    ,.,>,,    , 

Therefore,  any  linear  function  may  be  expressed  by  a  dyadic  as 
prefaetor  and  also  by  a  dyadic  as  postfactor. 
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111.  Def. — We  shall  say  that  a  dyadic  is  multiplied  by  a  scalar, 
when  one  of  the  vectors  of  each  of  its  component  dyads  is  multiplied 
by  that  scalar.  It  is  evidently  immaterial  to  which  vector  of  any 
dyad  the  scalar  factor  is  applied.  The  product  of  the  dyadic  $  and 
the  scalar  a  may  be  written  either  a$  or  $a.  The  minus  sign  before 
a  dyadic  reverses  the  signs  of  all  its  terms. 

112.  The  sign  +  in  a  dyadic,  or  connecting  dyadics,  may  be 
regarded  as  expressing  addition,  since  the  combination  of  dyads  and 
dyadics  with  this  sign  is  subject  to  the  laws  of  association  and 
commutation. 

113.  The  combination  of  vectors  in  a  dyad  is  evidently  distributive. 

Tb**' W>     [a+jS+etaltX+M+et^l^aX+ttM+iSX+iSM+etc. 

We  may  therefore  regard  the  dyad  as  a  kind  of  product  of  the  two 
vectors  of  which  it  is  formed.  Since  this  kind  of  product  is  not 
commutative,  we  shall  have  occasion  to  distinguish  the  factors  as 
antecedent  and  consequent. 

114.  Since  any  vector  may  be  expressed  as  a  sum  of  i,j,  andJfe  with 
scalar  coefficients,  every  dyadic  may  be  reduced  to  a  sum  of  the  nine 

7  ti,  ijt  tie,  ji,  jj,  jk,  hi,  fyt  **, 

with  scalar  coefficients.  Two  such  sums  cannot  be  equal  according  to 
the  definitions  of  No.  108,  unless  their  coefficients  are  equal  each  to 
each.  Hence  dyadics  are  equal  only  when  their  equality  can  be 
deduced  from  the  principle  that  the  operation  of  forming  a  dyad  is  a 
distributive  one. 

On  this  account,  we  may  regard  the  dyad  as  the  most  general  form 
of  product  of  two  vectors.  We  shall  call  it  the  indeterminate  product 
The  complete  determination  of  a  single  dyad  involves  five  independent 
scalars,  of  a  dyadic,  nine. 

115.  It  follows  from  the  principles  of  the  last  paragraph  that  if 

2a/3«2irX, 
**"  2ax£-2*xX, 

In  other  words,  the  vector  and  the  scalar  obtained  from  a  dyadic 
by  insertion  of  the  sign  of  skew  or  direct  multiplication  in  each  dyad 
are  both  independent  of  the  particular  form  in  which  the  dyadic  is 


We  shall  write  $x  and  #g  to  indicate  the  vector  and  the  scalar  thus 
obtained. 

$8  »  i&.i  + j.$.  j + k&.  Jfc, 
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as  is  at  once  evident,  if  we  suppose  $  to  be  expanded  in  terms  of 
it,  ij,  etc. 

116.  Def. — The  (direct)  product  of  two  dyads  (indicated  by  a  dot) 
is  the  dyad  formed  of  the  first  and  last  of  the  four  factors,  multiplied 
by  the  direct  product  of  the  second  and  third.    That  is, 

{a/3}.  {yS}  =*  a  /8.y  S  « )8.y  a&. 
The  (direct)  product  of  two  dyadics  is  the  sum  of  all  the  products 
formed  by  prefixing  a  term  of  the  first  dyadic  to  a  term  of  the  second. 
Since  the  direct  product  of  one  dyadic  with  another  is  a  dyadic,  it 
may  be  multiplied  in  the  same  way  by  a  third,  and  so  on  indefinitely. 
This  kind  of  multiplication  is  evidently  associative,  as  well  as  dis- 
tributive. The  same  is  true  of  the  direct  product  of  a  series  of  factors 
of  which  the  first  and  the  last  are  either  dyadics  or  vectors,  and  the 
other  factors  are  dyadics.    Thus  the  values  of  the  expressions 

a.#.e.*.i8,    a.#.e,    *.e.*.j8,    #.e.* 

will  not  be  affected  by  any  insertion  of  parentheses.  But  this  kind  of 
multiplication  is  not  commutative,  except  in  the  case  of  the  direct 
product  of  two  vectors. 

117.  Def — The  expressions  $xp  and  px$  represent  dyadics  which 
we  shall  call  the  skew  products  of  $  and  p.    If 

$=aX+j8/*+etc., 
these  skew  products  are  defined  by  the  equations 
*Xp=aXxp+/8/KXp+etc, 
pX*=pXaX+pXj8/u+etc 
It  is  evident  that 

{px*}.*=px  {*.*},  ¥.{$xp}  -  {*.$}xp, 

{pX$}.a=px[$.a],  a.{$Xp}=[a.$]xp, 

{pX#}Xa«pX{*Xa}. 
We  may  therefore  write  without  ambiguity 

pX$.¥,     ¥.#Xp,     pX*.a,     a&Xp,     pxi.a. 
This  may  be  expressed  a  little  more  generally  by  saying  that  the 
associative  principle  enunciated  in  No.  116  may  be  extended  to  cases 
in  which  the  initial  or  final  vectors  are  connected  with  the  other 
factors  by  the  sign  of  skew  multiplication. 
Moreover, 

a.pX$=[aXp].$  and  $Xp.a=*$.[pXa]. 
These  expressions  evidently  represent  vectors.    So 

¥.{px$}  =  {¥xp}.*. 
These  expressions  represent  dyadics,     The  braces  cannot  be  omitted 
without  ambiguity. 
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118.  Since  all  the  antecedents  or  all  the  consequents  in  any  dyadic 
may  be  expressed  in  parts  of  any  three  non-complanar  vectors,  and 
since  the  sum  of  any  number  of  dyads  having  the  same  antecedent 
or  the  same  consequent  may  be  expressed  by  a  single  dyad,  it  follows 
that  any  dyadic  may  be  expressed  as  the  sum  of  three  dyads,  and  so, 
that  either  the  antecedents  or  the  consequents  shall  be  any  desired 
non-complanar  vectors,  but  only  in  one  way  when  either  the  ante- 
cedents or  the  consequents  are  thus  given. 
In  particular,  the  dyadic 

avi+bij  +ci& 
+a'ji  +b'jj  +c'jk 
+a''H+&''A?>c''JfcJfc, 
which  may  for  brevity  be  written 


a      b      c 

- 

a'     V     c' 

*» 

a"    6"    c" 

is  equal  to 

ai+ffi+yk, 

where 

a=ai+a'j+a"k9 

/3=bi+b'j+b*k, 

y=ci +<#+<?"&, 

and  to 

*X +#!  +  **, 

where 

\=ai  +6;  +ck 

li=a'i+b'j+cfk 

V 

=  a"i+6";  + 

c"k 

119.  By  a  similar  process,  the  sum  of  three  dyads  may  be  reduced 
to  the  sum  of  two  dyads,  whenever  either  the  antecedents  or  the 
consequents  are  complanar,  and  only  in  such  cases.  To  prove  the 
latter  point,  let  us  suppose  that  in  the  dyadic 

aX+jfyt+yF 

neither  the  antecedents  nor  the  consequents  are  complanar.     The 

vector  {aX+Au+yv}.p 

is  a  linear  function  of  p  which  will  bo  parallel  to  a  when  p  is  perpen- 
dicular to  ft  and  v,  which  will  be  parallel  to  ft  when  p  is  perpendicular 
to  v  and  X,  and  which  will  be  parallel  to  y  when  p  is  perpendicular  to 
X  and  fi.  Hence,  the  function  may  be  given  any  value  whatever  by 
giving  the  proper  value  to  p.  This  would  evidently  not  be  the  case 
with  the  sum  of  two  dyads.  Hence,  by  No.  108,  this  dyadic  cannot  be 
equal  to  the  sum  of  two  dyads. 
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190.  In  like  manner,  the  sum  of  two  dyads  may  be  reduced  to  a 
single  dyad,  if  either  the  antecedents  or  the  consequents  are  parallel, 
and  only  in  such  cases. 

A  sum  of  three  dyads  cannot  be  reduced  to  a  single  dyad,  unless 
either  their  antecedents  or  consequents  are  parallel,  or  both  ante- 
cedents and  consequents  are  (separately)  complanar.  In  the  first  case 
the  reduction  can  always  be  made,  in  the  second,  occasionally. 

121.  Def. — A  dyadic  which  cannot  be  reduced  to  the  sum  of  less 
than  three  dyads  will  be  called  complete. 

A  dyadic  which  can  be  reduced  to  the  sum  of  two  dyads  will  be 
called  planar.  When  the  plane  of  the  antecedents  coincides  with 
that  of  the  consequents,  the  dyadic  will  be  called  wniplanar.  These 
planes  are  invariable  for  a  given  dyadic,  although  the  dyadic  may  be 
so  expressed  that  either  the  two  antecedents  or  the  two  consequents 
may  have  any  desired  values  (which  are  not  parallel)  within  their 
planes. 

A  dyadic  which  can  be  reduced  to  a  single  dyad  will  be  called 
linear.  When  the  antecedent  and  consequent  are  parallel,  it  will  be 
called  unilinear  . 

A  dyadic  is  said  to  have  the  value  zero  when  all  its  terms  vanish. 

122.  If  we  set  .  , 

<r»$.p,        T—p.T, 

and  give  p  all  possible  values,  <r  and  t  will  receive  all  possible  values, 
if  $  is  complete.  The  values  of  cr  and  t  will  be  confined  each  to  a 
plane  if  i  is  planar,  which  planes  will  coincide  if  $  is  uniplanar. 
The  values  of  <r  and  t  will  be  confined  each  to  a  line  if  $  is  linear, 
which  lines  will  coincide  if  $  is  unilinear. 

123.  The  products  of  complete  dyadics  are  complete,  of  complete 
and  planar  dyadics  are  planar,  of  complete  and  linear  dyadics  are 
linear. 

The  products  of  planar  dyadics  are  planar,  except  that  when  the 
plane  of  the  consequents  of  the  first  dyadic  is  perpendicular  to  the 
plane  of  the  antecedents  of  the  second  dyadic,  the  product  reduces 
to  a  linear  dyadic. 

The  products  of  linear  dyadics  are  linear,  except  that  when  the 
consequent  of  the  first  is  perpendicular  to  the  antecedent  of  the 
second,  the  product  reduces  to  zero. 

The  products  of  planar  and  linear  dyadics  are  linear,  except  when, 
the  planar  preceding,  the  plane  of  its  consequents  is  perpendicular  to 
the  antecedent  of  the  linear,  or,  the  linear  preceding,  its  consequent  is 
perpendicular  to  the  plane  of  the  antecedents  of  the  planar.  In  these 
cases  the  product  is  zero. 

All  these  caste  are  readily  proved,  if  we  set 
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and  consider  the  limits  within  which  <r  varies,  when  we  give  p  all 
possible  values. 

The  products  ^xp  and  px$  are  evidently  planar  dyadics. 

124  Def. — A  dyadic  $  is  said  to  be  an  idemfactor,  when 

$.p**p  for  all  values  of  p, 
or  when  p.§>=*p  for  all  values  of  p. 

If  either  of  these  conditions  holds  true,  $  must  be  reducible  to  the 
form  ••,••.  77 

Therefore,  both  conditions  will  hold,  if  either  does.    All  such  dyadics 

are  equal,  by  No.  108.    They  will  be  represented  by  the  letter  L 

The  direct  product  of  an  idemfactor  with  another  dyadic  is  equal  to 

that  dyadic.    That  is, 

L3>=$,        $.I=#, 

where  $  is  any  dyadic. 

A  dyadic  of  the  form  ,  ,  DQ,  ,       , 

act  -rpp +yy$ 

in  which  a ,  f¥%  y  are  the  reciprocals  of  a,  £,  y,  is  an  idemfactor. 
(See  No.  38.)  A  dyadic  trinomial  cannot  be  an  idemfactor,  unless  its 
antecedents  and  consequents  are  reciprocate. 

125.  If  one  of  the  direct  products  of  two  dyadics  is  an  idemfactor, 
the  other  is  also.    For,  if  &Mr = I, 

for  aU  values  of  <r,  and  $  is  complete ; 

for  all  values  of  <r,  therefore  for  all  values  of  o*.$,  and  therefore 

Def — In  this  case,  either  dyadic  is  called  the  reciprocal  of  the 
other. 

It  is  evident  that  an  incomplete  dyadic  cannot  have  any  (finite) 
reciprocal. 

Reciprocals  of  the  same  dyadic  are  equal  For  if  $  and  ¥  are  both 
reciprocals  of  Q,  $=*.Q.¥=¥. 

If  two  dyadics  are  reciprocals,  the  operators  formed  by  using  these 
dyadics  as  prefactors  are  inverse,  also  the  operators  formed  by  using 
them  as  postfactors. 

126.  The  reciprocal  of  any  complete  dyadic 
dk+fi/jL+yv 
is  \'a'+jj,'f¥+v'y, 

where  a ,  jff,  y  are  the  reciprocals  of  a,  £,  y,  and  A',  /i'»  v  ***  $& 
reciprocals  of  X,  /a,  v.    (See  No.  38.) 
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127.  Def. — We  shall  write  $~l  for  the  reciprocal  of  any  (complete) 
dyadic  *,  also  $*  for  $.$,  etc,  and  $-f,  for  **1.$-1,  etc  It  is 
evident  that  $~*  is  the  reciprocal  of  $", 

128.  In  the  redaction  of  equations,  if  we  have 

we  may  cancel  the  *  (which  is  equivalent  to  multiplying  by  $-*)  if 
$  is  a  complete  dyadic,  but  not  otherwise  The  case  is  the  same  with 
such  equations  as 

*.or=*.p,    ¥.$«Q.*,    p.#«<r.*. 

To  cancel  an  incomplete  dyadic  in  such  cases  would  be  analogous  to 
cancelling  a  zero  factor  in  algebra. 

129.  Def. — If  in  any  dyadic  we  transpose  the  factors  in  each  term, 
the  dyadic  thus  formed  is  said  to  be  conjugate  to  the  first    Thus 

aA+/8/K+yv  and  Xa+/Aj8+Ky 

are  conjugate  to  each  other.  A  dyadic  of  which  the  value  is  not 
altered  by  such  transposition  is  said  to  be  edf-wnjugate.  The  con- 
jugate of  any  dyadic  *  may  be  written  $<>    It  is  evident  that 

$0.p  and  $.p  are  conjugate  functions  of  p.  (See  No.  106.)  Since 
{$<,}*=  {#*}<)>  we  may  write  #§,  etc,  without  ambiguity. 

180.  The  reciprocal  of  the  product  of  any  number  of  dyadics  is 
equal  to  the  product  of  their  reciprocals  taken  in  inverse  order.    Thus 

The  conjugate  of  the  product  of  any  number  of  dyadics  is  equal  to 
the  product  of  their  conjugates  taken  in  inverse  order.    Thus 

{*.*.0}o»Qo*o*o. 
Hence,  since  ^o-l*"1}©18^*"1'*}©""!* 

and  we  may  write  $£*  without  ambiguity* 

181.  It  is  sometimes  convenient  to  be  able  to  express  by  a  dyadic 
taken  in  direct  multiplication  the  same  operation  which  would  be 
effected  by  a  given  vector  (a)  in  skew  multiplication.  The  dyadic 
Ixa  will  answer  this  purpose.    For,  by  No.  117, 

{Ixa}.p=aXp,  p.{Ixa}  =*pXa, 

{Ixa}.$=*aX$,  $.{Ixa}  =#Xou 

The  same  is  true  of  the  dyadic  axl,  which  is  indeed  identical  with 
Ixa,  as  appears  from  the  equation  I. {axl}  =  {Ixa}. I. 
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If  a  is  a  unit  vector, 

{Ixa}«=-{I-aa}, 
{Ixa}*=-Ixa, 
{Ixa}4=I— act, 
{Ixa}*  =  lxa, 
etc. 

If  itj,  k  are  a  normal  system  of  unit  vectors 

Ixi=ixI=A#  — jk. 
Ixj  =jxl=ik—ki, 
Ixk=kxl=ji —ij. 
If  a  and  /8  are  any  vectors, 

[aX/8]xI=Ix[aXj8] =/8a  -a£ 
That  is,  the  vector  aXj8  as  a  pre-  or  post-factor  in  skew  multipli- 
cation is  equivalent  to  the  dyadic  {/8a— a/9}  taken  as  pre-  or  post- 
factor  in  direct  multiplication. 

[aXj8]xp=:{j8a-a£}./>, 
px[aXj8]=p.{j8a-aj8}. 

This  is  essentially  the  theorem  of  No.  27,  expressed  in  a  form  more 
symmetrical,  and  more  easily  remembered. 
132.  The  equation 

a  j8xy  +  /8yXa  +  yaX)8=a.j8xyI . 
gives,  on  multiplication  by  any  vector  p,  the  identical  equation 

p.a/8xy  +  p.jSyXa  +  p.y  aX/8=a./8xyp. 

(See  No.  87.)  The  former  equation  is  therefore  identically  true. 
(See  No.  108.)    It  is  a  little  more  general  than  the  equation 

aa'+)fy8/+yy,=I, 

which  we  have  already  considered  (No.  124),  since,  in  the  form  here 
given,  it  is  not  necessary  that  a,  /3,  and  y  should  be  non-complanar. 
We  may  also  write 

/8xya+yXaj8+aXj8y=a.j8xyl. 
Multiplying  this  equation  by  p  as  prefactor  (or  the  first  equation  by 
p  as  postfactor),  we  obtain 

p./8xy  a  +  p.yxajS  +  p.ax/3y=a.£xyp. 
(Compare  No.  37.)    For  three  complanar  vectors  we  have 

a/8xy  +  jSyXa  +  yaX/8=0. 
Multiplying  this  by  v,  a  unit  normal  to  the  plane  of  a,  j8,  and  y 
we  have 

afixy.v  +  pyxa.v  +  y  aX/8.v=0. 
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This  equation  expresses  the  well-known  theorem  that  if  the  geometrical 
sum  of  three  vectors  is  zero,  the  magnitude  of  each  vector  is  propor- 
tional to  the  sine  of  the  angle  between  the  other  two.  It  also  indicates 
the  numerical  coefficients  by  which  one  of  three  complanar  vectors  may 
be  expressed  in  parts  of  the  other  two. 

133.  Def. — If  two  dyadics  $  and  ¥  are  such  that 

they  are  said  to  be  Jiomdlogoua. 

If  any  number  of  dyadics  are  homologous  to  one  another,  and  any 
other  dyadics  are  formed  from  them  by  the  operations  of  taking 
multiples,  sums,  differences,  powers,  reciprocals,  or  products,  such 
dyadics  will  be  homologous  to  each  other  and  to  the  original  dyadics. 
This  requires  demonstration  only  in  regard  to  reciprocals.    Now  if 

That  is,  $~*  is  homologous  to  ¥,  if  $  is. 

134.  If  we  call  "ir.i"1  or  i"1.*  the  quotient  of  ¥  and  *,  we  may 
say  that  the  rules  of  addition,  subtraction,  multiplication  and  division 
of  homologous  dyadics  are  identical  with  those  of  arithmetic  or 
ordinary  algebra,  except  that  limitations  analogous  to  those  respecting 
zero  in  algebra  .must  be  observed  with  respect  to  all  incomplete 
dyadics. 

It  follows  that  the  algebraic  and  higher  analysis  of  homologous 
dyadics  is  substantially  identical  with  that  of  scalars. 

135.  It  is  always  possible  to  express  a  dyadic  in  three  terms,  so 
that  both  the  antecedents  and  the  consequents  shall  be  perpendicular 
among  themselves. 

To  show  this  for  any  dyadic  $,  let  us  set 

p  being  a  unit-vector,  and  consider  the  different  values  of  p'  for  all 
possible  directions  of  p.  Let  the  direction  of  the  unit  vector  i  be  so 
determined  that  when  p  coincides  with  i,  the  value  of  p  shall  be  at 
least  as  great  as  for  any  other  direction  of  p.  And  let  the  direction 
of  the  unit  vector  j  be  so  determined  that  when  p  coincides  with  j, 
the  value  of  p'  shall  be  at  least  as  great  as  for  any  other  direction  of  p 
which  is  perpendicular  to  i  Let  k  have  its  usual  position  with 
respect  to  i  and  j.    It  is  evidently  possible  to  express  *  in  the  form 

We  have  therefore 

p'«{ai+#+yi}.p, 

and  dp'«{at+#+yi}.<*p. 
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Now  the  supposed  property  of  the  direction  of  t  requires  that  when  p 
coincides  with  t  and  dp  is  perpendicular  to  i,  dp'  shall  be  perpendicular 
to  />',  which  will  then  be  parallel  to  a.  But  if  dp  is  parallel  to  j  or  ib, 
it  will  be  perpendicular  to  i,  and  dp  will  be  parallel  to  #  or  y,  as  the 
case  may  be.  Therefore  /3  and  y  are  perpendicular  to  a.  In  the  same 
way  it  may  be  shown  that  the  condition  relative  to  j  requires  that  y 
shall  be  perpendicular  to  j8.    We  may  therefore  set 

where  i\  j',  #,  like  i,  j,  h,  constitute  a  normal  system  of  unit  vectors 
(see  No.  11),  and  a,  6,  c  are  scalars  which  may  be  either  positive  or 
negative. 

It  makes  an  important  difference  whether  the  number  of  these 
scalars  which  are  negative  is  even  or  odd.  If  two  are  negative,  say  a 
and  6,  we  may  make  them  positive  by  reversing  the  directions  of  i' 
and  /.  The  vectors  i\  j',  kf  will  still  constitute  a  normal  system. 
But  if  we  should  reverse  the  directions  of  an  odd  number  of  these 
vectors,  they  would  cease  to  constitute  a  normal  system,  and  to  be 
superposable  upon  the  system  i,  j,  k.    We  may,  however,  always  set 

dther  $=wi'i+b}'j+cl<fh 

or  $=»  -{at'i+fc#+c*'*}> 

with  positive  values  of  a,  6,  and  c.    At  the  limit  between  these  cases 

are  the  planar  dyadics,  in  which*  one  of  the  three  terms  vanishes,  and 

the  dyadic  reduces  to  the  form 

ai'i+bj'j, 
in  which  a  and  b  may  always  be  made  positive  by  giving  the  proper 
directions  to  i'  and  /. 

If  the  numerical  values  of  a,  6,  c  are  all  unequal,  there  will  be  only 
one  way  in  which  the  value  of  $  may  be  thus  expressed.  If  they  are 
not  all  unequal,  there  will  be  an  infinite  number  of  ways  in  which  $ 
may  be  thus  expressed,  in  all  of  which  the  three  scalar  coefficients 
will  have  the  same  values  with  exception  of  the  changes  of  signs 
mentioned  above.  If  the  three  values  are  numerically  identical,  we 
may  give  to  either  system  of  normal  vectors  an  arbitrary  position. 

136.  It  follows  that  any  self-conjugate  dyadic  may  be  expressed 
in  the  form  aii+ljj+ckk, 

where  it  j,  k  are  a  normal  system  of  unit  vectors,  and  a,  6,  c  are 
positive  or  negative  scalars. 

137.  Any  dyadic  may  be  divided  into  two  parts,  of  which  one  shall 
be  self-conjugate,  and  the  other  of  the  form  Ixa.  These  parts  are 
found  by  taking  half  the  sum  and  half  the  difference  of  the  dyadic 
and  its  conjugate.    It  is  evident  that 
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Now  ${#+$<,}  is  self -conjugate,  and 

(See  No.  131.) 

Rotations  ana  Strain* 

138.  To  illustrate  the  nee  of  dyadics  as  operators,  let  us  suppose 
that  a  body  receives  such  a  displacement  that 

p  and  p  being  the  position-vectors  of  the  same  point  of  the  body  in 
its  initial  and  subsequent  positions.  The  same  relation  will  hold  of 
the  vectors  which  unite  any  two  points  of  the  body  in  their  initial 
and  subsequent  positions.  For  if  p19  pt  are  the  original  position- 
vectors  of  the  points,  and  p{>  />,'  their  final  position-vectors,  we  have 

Pi'-*Pi»        /*'«#./*, 
whence  ,       ,    ,r  , 

Pt  -Pi  =*LPi-piJ- 

In  the  most  general  case,  the  body  is  said  to  receive  a  homogeneous 
strain.  In  special  cases,  the  displacement  reduces  to  a  rotation, 
lines  in  the  body  initially  straight  and  parallel  will  be  straight  and 
parallel  after  the  displacement,  and  surfaces  initially  plane  and 
parallel  will  be  plane  and  parallel  after  the  displacement. 

189.  The  vectors  (<r,  cr)  which  represent  any  plane  surface  in  the 
body  in  its  initial  and  final  positions  will  be  linear  functions  of  each 
other.  (This  will  appear,  if  we  consider  the  four  sides  of  a  tetra- 
hedron in  the  body.)  To  find  the  relation  of  the  dyadics  which 
express  </  as  a  function  of  <r,  and  p  as  a  function  of  />,  let 

p'«{aX+£A*+yF}./>. 

Then,  if  we  write  X',  //,  v  for  the  reciprocals  of  X,  /a,  v,  the  vectors 
X',  ft,  v  become  by  the  strain  a,  /3,  y.  Therefore  the  surfaces  /z'x*', 
ZxX',  X'x/a'  become  j8xy,  yxa,  aXj8.  But  /»'xr ,  v'xX',  X'x/i'  are  the 
reciprocals  of  fiXv,  vx\,  Xxa*.    The  relation  sought  is  therefore 

<r'«  {/8xy  fiXv  +  yxa  *xX  +  aXj8  Xxm}-<t. 

140.  The  volume  X'./i'x*'  becomes  by  the  strain  a./3xy.  The 
unit  of  volume  becomes  therefore  (a.)8xy)(X.AiXv). 

Def. — It  follows  that  the  scalar  product  of  the  three  antecedents 
multiplied  by  the  scalar  product  of  the  three  consequents  of  a  dyadic 
expressed  as  a  trinomial  is  independent  of  the  particular  form  in 
which  the  dyadic  is  thus  expressed.  This  quantity  is  the  determinant 
of  the  coefficients  of  the  nine  terms  of  the  form 

aii+bvj+ebc, 
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into  which  the  dyadic  may  be  expanded.     We  shall  call  it  the 
determinant  of  the  dyadic,  and  shall  denote  it  by  the  notation 

1*1 

when  the  dyadic  is  expressed  by  a  single  letter. 

If  a  dyadic  is  incomplete,  its  determinant  is  zero,  and  conversely. 

The  determinant  of  the  product  of  any  number  of  dyadics  is 
equal  to  the  product  of  their  determinants.  The  determinant  of  the 
reciprocal  of  a  dyadic  is  the  reciprocal  of  the  determinant  of  that 
dyadic    The  determinants  of  a  dyadic  and  its  conjugate  are  equal. 

The  relation  of  the  surfaces  <r  and  <r  may  be  expressed  by  the  equation 

tr'-l*!**)-1.** 

141.  Let  us  now  consider  the  different  cases  of  rotation  and  strain 

as  determined  by  the  nature  of  the  dyadic  $. 
If  $  is  reducible  to  the  form 

i'i+j'j+Vk, 
i>j>  k  i',j',  kf  being  normal  systems  of  unit  vectors  (see  No.  11),  the 
body  will  suffer  no  change  of  form.    For  if 

p=xi+yj+zk, 
we  shall  have 

P~xi'+yj'+9tf. 
Conversely,  if  the  body  suffers  no  change  of  form,  the  operating 
dyadic  is  reducible  to  the  above  form.  In  such  cases,  it  appears  from 
simple  geometrical  considerations  that  the  displacement  of  the  body 
may  be  produced  by  a  rotation  about  a  certain  axis.  A  dyadic 
reducible  to  the  form 

i'i+j'j+Kk 
may  therefore  be  called  a  versor. 

142.  The  conjugate  operator  evidently  produces  the  reverse  rotation. 
A  versor,  therefore,  is  the  reciprocal  of  its  conjugate. 

Conversely,  if  a  dyadic  is  the  reciprocal  of  its  conjugate,  it  is  either 
a  versor,  or  a  versor  multiplied  by  —  1.  For  the  dyadic  may  be 
expressed  in  the  form 

Its  conjugate  will  be  ia+jjl+ky. 

If  these  are  reciprocals,  we  have 

{ai+&+yk}.{ia+JI3+ty}  =  aa+pp+yy-L 
But  this  relation  cannot  subsist  unless  a,  /8,  y  are  reciprocals  to 
themselves,  Le.,  unless  they  are  mutually  perpendicular  unit-vectors. 
Therefore,  they  either  are  a  normal  system  of  unit- vectors,  or  will 
become  such  if  their  directions  are  reversed.  Therefore,  one  of  the 
dyadics 

ai+ffi+yk    and     -at-#-yife 
is  a  versor. 


*[8eenoteonp.  90.] 
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The  criterion  of  a  versor  may  therefore  be  written 
*.*o-I,    and    |*|-1. 
For  the  last  equation  we  may  substitute 

|*|>0,    or    |*|5-1. 

It  is  evident  that  the  resultant  of  successive  finite  rotations  is 
obtained  by  multiplication  of  the  versors. 

143.  If  we  take  the  axis  of  the  rotation  for  the  direction  of  i, 
i'  will  have  the  same  direction,  and  the  versor  reduces  to  the  form 

K+fj+M, 
in  which  %,  j9  k  and  %,]',  V  are  normal  systems  of  unit  vectors. 
We  may  set 

j'**eoeqj  +  mnqk, 

tf—eoBqk  —  mnqjr 
and  the  versor  reduces  to 

ii  +  00S?  {#+**}  +  sing  {#-j*}> 
or 

ii  +  cosg  {I -it}  +  singlxi, 

where  q  is  the  angle  of  rotation,  measured  from  j  toward  Jb,  if  the 
versor  is  used  as  a  prefactor. 

144.  When  any  versor  *  is  used  as  a  prefactor,  the  vector  —  $x 
will  be  parallel  to  the  axis  of  rotation,  and  equal  in  magnitude  to 
twice  the  sine  of  the  angle  of  rotation  measured  counter-clockwise  as 
seen  from  the  direction  in  which  the  vector  points.  (This  will  appear 
if  we  suppose  *  to  be  represented  in  the  form  given  in  the  last 
paragraph.)  The  scalar  *8  will  be  equal  to  unity  increased  by  twice 
the  cosine  of  the  same  angle.  Together,  —  *x  and  $fe  determine  the 
versor  without  ambiguity.    If  we  set 

9  1+*.' 

the  magnitude  of  6  will  be 

2sing 
0  ,  0    *       or   tan  Jo, 

where  q  is  measured  counter-clockwise  as  seen  from  the  direction  in 
which  0  points.  This  vector  0,  which  we  may  call  the  vector  eemi- 
tangent  of  version,  determines  the  versor  without  ambiguity. 

145.  The  versor  $  may  be  expressed  in  terms  of  6  in  various  ways. 
Since  *  (as  prefactor)  changes  a—6xa  into  a+dxa  (a  being  any 
vector),  we  have 

S-fl+IxdMI-Ixd}-1. 

-  ee+{i+ixfl}*  (1-0.0)1+230+21x0 

*"      1+0T5      "  T+83  » 
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as  will  be  evident  on  considering  separately  in  the  expression  #.p 
the  components  perpendicular  and  parallel  to  ft  or  on  substituting  in 

<i  +  caBq(jj+kk)  +  Bmq(fy—jk) 

for  oooq  and  sing  their  values  in  terms  of  tan  J^. 
If  we  set,  in  either  of  these  equations, 

6**ai+bj+ck, 

we  obtain,  on  reduction,  the  formula 

r(l+a«-6i-<j«)^4.(2a6-2c)v>(2ac+26)i*      ) 
\  +(2ab+2c)ji+(l-a*+V-<fyjj+(2be-2a)jk  \ 

in  which  the  versor  is  expressed  in  terms  of  the  rectangular  com- 
ponents of  the  vector  semitangent  of  version. 

146.  If  a,  /8,  y  are  unit  vectors,  expressions  of  the  form 

2aa-I,       2/8/8-1,       2yy-I, 

are  biquadrantal  versors.    A  product  like 

{2#8-I}.{2aa-I} 

is  a  versor  of  which  the  axis  is  perpendicular  to  a  and  /3,  and  the 
amount  of  rotation  twice  that  which  would  carry  a  to  /8.  It  is 
evident  that  any  versor  may  be  thus  expressed,  and  that  either  a  or  £ 
may  be  given  any  direction  perpendicular  to  the  axis  of  rotation.    If 

#-{2/30-1}. {2aa-I},    and    ¥={2yy-I}.{2/30-I}, 
we  have  for  the  resultant  of  the  successive  rotations 

¥.$»{2yy-I}.{2aa-I}. 
This  may  be  applied  to  the  composition  of  any  two  successive 
rotations,  /8  being  taken  perpendicular  to  the  two  axes  of  rotation, 
and  affords  the  means  of  determining  the  resultant  rotation  by 
construction  on  the  surface  of  a  sphere.  It  also  furnishes  a  simple 
method  of  finding  the  relations  of  the  vector  semitangents  of  version 
for  the  versors  #,  ¥,  and  ¥.$.    Let 

a  _  "*x  a  _  —  xx  A  _  —  {y.$}x 

*   I+5f        f    T+¥*       *i+ {*.*}■' 
Then,  since 

$«4a.0£a-2aa-2#8+I, 

which  is  moreover  geometrically  evident    In  like  manner, 

,0* 


P-Y  "Y 
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Therefore, 

^j8.a/8xy+fl./8yXa+/8.yaxfl 
—  *-ftfty 

(SeeNaSa)    That  is, 

Also,  ^.SSftto.i-     •3L.. 

Hence,  e1xe1«e,-(i-Mt)*»+^ 

A       Or  +  0,  +  OtXtf! 

*~     i-Mf     • 

which  is  the  formula  for  the  composition  of  successive  finite  rotations 
by  means  of  their  vector  semitangents  of  version. 

147.  The  versors  just  described  constitute  a  particular  class  under 
the  more  general  form 

aa'  +  cos  q  {fiP+yy'}  +  sin  q  {y/ff-By'} , 

in  which  a,  ft,  y  are  any  non-oomplanar  vectors,  and  a,  fjf,  y  their 
reciprocals.  A  dyadic  of  this  form  as  a  prefactor  does  not  affect  any 
vector  parallel  to  a.  Its  effect  on  a  vector  in  the  £-y  plane  will  be 
best  understood  if  we  imagine  an  ellipse  to  be  described  of  which 
ft  and  y  are  conjugate  semi-diameters.  If  the  vector  to  be  operated 
on  be  a  radius  of  this  ellipse,  we  may  evidently  regard  the  ellipse 
with  ft,  y,  and  the  other  vector,  as  the  projections  of  a  circle  with  two 
perpendicular  radii  and  one  other  radius.  A  little  consideration  will 
show  that  if  the  third  radius  of  the  circle  is  advanced  an  angle  q,  its 
projection  in  the  ellipse  will  be  advanced  as  required  by  the  dyadic 
prefactor.  The  effect,  therefore,  of  such  a  prefactor  on  a  vector  in  the 
ft-y  plane  may  be  obtained  as  follows :  Describe  an  ellipse  of  which 
ft  and  y  are  conjugate  semi-diameters.  Then  describe  a  similar  and 
similarly  placed  ellipse  of  which  the  vector  to  be  operated  on  is  a 
radius.  The  effect  of  the  operator  is  to  advance  the  radius  in  this 
ellipse,  in  the  angular  direction  from  ft  toward  y,  over  a  segment 
which  is  to  the  total  area  of  the  ellipse  as  q  is  to  2*\  When  used  as 
a  postf actor,  the  properties  of  the  dyadic  are  similar,  but  the  axis  of 
no  motion  and  the  planes  of  rotation  are  in  general  different. 

Def. — Such  dyadics  we  shall  call  cycbic. 

The  Nth  power  (N  being  any  whole  number)  of  such  a  dyadic  is 
obtained  by  multiplying  q  by  N.  If  q  is  of  the  form  2xN/M  (N  and 
M  being  any  whole  numbers)  the  Mth  power  of  the  dyadic  will  be  an 
idemfactor.    A  cyclic  dyadic,  therefore,  may  be  regarded  as  a  root  of 
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I,  or  at  least  capable  of  expression  with  any  required  degree  of 
accuracy  as  a  root  of  L 

It  should  be  observed  that  the  value  of  the  above  dyadic  will  not 
be  altered  by  the  substitution  for  a  of  any  other  parallel  vector,  or 
for  fi  and  y  of  any  other  conjugate  semi-diameters  (which  succeed  one 
another  in  the  same  angular  direction)  of  the  same  or  any  similar  and 
similarly  situated  ellipse,  with  the  changes  which  these  substitutions 
require  in  the  values  of  a',  ft,  y.  Or,  to  consider  the  same  changes 
from  another  point  of  view,  the  value  of  the  dyadic  will  not  be  altered 
by  the  substitution  for  a  of  any  other  parallel  vector  or  for  fj>  and  y 
of  any  other  conjugate  semi-diameters  (which  succeed  one  another  in 
the  same  angular  direction)  of  the  same  or  any  similar  and  similarly 
situated  ellipse,  with  the  changes  which  these  substitutions  require  in 
the  values  of  a,  /3,  and  y,  denned  as  reciprocals  of  a,  (3>,  y. 

148:*  The  strain  represented  by  the  equation 

p'=*{<rii+by+clck}.p 

where  a,  b,  c  are  positive  scalars,  may  be  described  as  consisting  of 
three  elongations  (or  contractions)  parallel  to  the  axes  i,  j,  jfe,  which 
are  called  the  principal  axes  of  the  strain,  and  which  have  the  pro- 
perty that  their  directions  are  not  affected  by  the  strain.  The  scalars 
a,  6,  e  are  called  the  principal  ratios  of  elongation.  (When  one  of 
these  is  less  than  unity,  it  •represents  a  contraction.)  The  order  of 
the  three  elongations  is  immaterial,  since  the  original  dyadic  is  equal 
to  the  product  of  the  three  dy adics 

oAi+jj+hk,        ii+bjj+lck,        ii+jj+ckk 

taken  in  any  order. 

Def — A  dyadic  which  is  reducible  to  this  form  we  shall  call  a  right 
tensor.  The  displacement  represented  by  a  right  tensor  is  called  a 
pwre  strain.    A  right  tensor  is  evidently  self -conjugate. 

149.  We  have  seen  (No.  135)  that  every  dyadic  may  be  expressed 
in  the  form 

±{ai'i+bj'j+cVk}, 

where  a,  bt  c  are  positive  scalars.    This  is  equivalent  to 

±{riV+bjy+ckfK}.{i'i+fj+Vk} 
and  to 

±{i'i+j'j+Vk}.{aii+bjj+ckk). 

Hence  every  dyadic  may  be  expressed  as  the  product  of  a  versor  and 
a  right  tensor  with  the  scalar  factor  ±1.  The  versor  may  precede  or 
follow.  It  will  be  the  same  versor  in  either  case,  and  the  ratios  of 
elongation  will  be  the  same;  but  the  position  of  the  principal  axes 
of  the  tensor  will  differ  in  the  two  cases,  either  system  being  derived 
from  the  other  by  multiplication  by  the  versor. 
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Def. — The  displacement  represented  by  the  equation 

is  called  inversion.  The  most  general  case  of  a  homogeneous  strain 
may  therefore  be  produced  by  a  pure  strain  and  a  rotation  with  or 
without  inversion. 

150.  If 

$=ai'i+y';+cfc'Jfe, 

and  $o«* '— af*i+W ljj+cHdc 

The  general  problem  of  the  determination  of  the  principal  ratios  and 
axes  of  strain  for  a  given  dyadic  may  thus  be  reduced  to  the  case  of 
a  right  tensor. 

151.  Def. — The  effect  of  a  prefactor  of  the  form 

acta  +  bfify  •+■  cyy, 
where  a,  6,  c  are  positive  or  negative  scalars,  a,  ft,  y  non-complanar 
vectors,  and  a,  ft,  y  their  reciprocals,  is  to  change  a  into  aa,  ft  into 
bfi,  and  y  into  cy.  As  a  postfactor,  the  same  dyadic  will  change  a 
into aa',fy  into  fyS*,  and  y  into  cy.  Dyadics  which  can  be  reduced  to 
this  form  we  shall  call  tonic  (Gr.  re/w).  The  right  tensor  already 
described  constitutes  a  particular  case,  distinguished  by  perpendicular 
axes  and  positive  values  of  the  coefficients  a,  6,  c. 

The  value  of  the  dyadic  is  evidently  not  affected  by  substituting 
vectors  of  different  lengths  but  the  same  or  opposite  directions  for 
a,  /8,  y,  with  the  necessary  changes  in  the  values  of  a',  ft,  y',  defined 
as  reciprocals  of  a,  /8,  y.  But,  except  this  change,  if  a,  6,  c  are 
unequal,  the  dyadic  can  be  expressed  only  in  one  way  in  the  above 
form.  If,  however,  two  of  these  coefficients  are  equal,  say  a  and  6, 
any  two  non-collinear  vectors  in  the  a-fi  plane  may  be  substituted 
for  a  and  /8,  or,  if  the  three  coefficients  are  equal,  any  three  non- 
complanar  vectors  may  be  substituted  for  a,  /8,  y. 

152.  Tonics  having  the  same  axes  (determined  by  the  directions 
of  a,  /8,  y)  are  homologous,  and  their  multiplication  is  effected  by 
multiplying  their  coefficients.    Thus, 

{o1aa/+61/3i8'+clyy'}.{a2aa/+6^i8'+C1yy'} 

-  {OyO^oia  +bJ)JiP +c1ctyy'}. 

Hence,  division  of  such  dyadics  is  effected  by  division  of  their  co- 
efficients. A  tonic  of  which  the  three  coefficients  a,  6,  c  are  unequal, 
is  homologous  only  with  such  dyadics  as  can  be  obtained  by  varying 
the  coefficients. 

153.  The  effect  of  a  prefactor  of  the  form 

aaa'  +  6  {Afr+yy'}  +  cfy/S'-jSy'}, 
or  aaa  +p cos g{#8'+yy/}  +pemq{yP-Py  }, 


Digitized  by  VjOOQIC 


70  VECTOR  ANALYSIS. 

where  a,  f¥t  y  are  the  reciprocals  of  a,  /8,  y,  and  a,  6,  c,  p,  and  q  are 
scalars,  of  which  p  is  positive,  will  be  most  evident  if  we  resolve  it 
into  the  factors 

aaa'+pP+yy', 

aa+pPP+pyy, 

aa'  +  <x*q\PP+yy'}  +  sing  {y^-^y'}, 

of  which  the  order  is  immaterial,  and  if  we  suppose  the  vector  on 
which  we  operate  to  be  resolved  into  two  factors,  one  parallel  to  a, 
and  the  other  in  the  /8-y  plana  The  effect  of  the  first  factor  is  to 
multiply  by  a  the  component  parallel  to  a,  without  affecting  the  other. 
The  effect  of  the  second  is  to  multiply  by  p  the  component  in  the 
/8-y  plane  without  affecting  the  other.  The  effect  of  the  third  is  to 
give  the  component  in  the  /8-y  plane  the  kind  of  elliptic  rotation 
described  in  No.  147. 

The  effect  of  the  same  dyadic  as  a  postfactor  is  of  the  same  nature. 

The  value  of  the  dyadic  is  not  affected  by  the  substitution  for  a  of 
another  vector  having  the  same  direction,  nor  by  the  substitution  for 
fi  and  y  of  two  other  conjugate  semi-diameters  of  the  same  or  a 
similar  and  similarly  situated  ellipse,  and  which  follow  one  another 
in  the  same  angular  direction. 

Def. — Such  dyadics  we  shall  call  cyclotonic. 

154.  Cyclotonics  which  are  reducible  to  the  same  form  except 
with  respect  to  the  values  of  a,  p,  and  q  are  homologous.  They  are 
multiplied  by  multiplying  the  values  of  a,  and  also  those  of  p,  and 
adding  those  of  q.    Thus,  the  product  of 

a^aa'  +  px  cos  ft^+yy'}  +p1sing1{y£'-/8y/} 

and  Ojaa'  +  pi<xxqi{PP+yy'}  +p1sing1{y/8'-/8y/} 

is  a^a'  +  p&t  cos  (?1+?1){0/8'+yy} 

+p1ptBm(q1+qJ{yP-py). 

A  dyadic  of  this  form,  in  which  the  value  of  q  is  not  zero,  or  the 
product  of  x  and  a  positive  or  negative  integer,  is  homologous  only 
with  such  dyadics  as  are  obtained  by  varying  the  values  of  a,  p,  and  q. 

155.  In  general,  any  dyadic  may  be  reduced  to  the  form  either  of  a 
tonic  or  of  a  cyclotonic  (The  exceptions  are  such  as  are  made  by 
the  limiting  cases.)  We  may  show  this,  and  also  indicate  how  the 
reduction  may  be  made,  as  follows.  Let  $  be  any  dyadic.  We  have 
first  to  show  that  there  is  at  least  one  direction  of  p  for  which 

$.p  =*<*/>. 
This  equation,  is  equivalent  to 

or,  {$-aI}.psO. 
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That  is,  *— ol  is  a  planar  dyadic,  which  may  be  expressed  by  the 
option  |$-ol|-*0. 

(See  No.  140.)    Let  #.Xi+M;+,Jb; 

the  equation  becomes 

|[X-a*li+[/i-<0li+[v-ai]*|  -0, 
or,  [X-at]x[/i-a;l.[K-ai]«0, 

or,   a?—(i.\+j.p+k.y)a*+(i.px¥+j.¥x\+k.\xp)a-\xii.¥=0. 
This  may  be  written 

o?-*fcrf+{*-»}s|*|a-|*|.a* 
Now  if  the  dyadic  #  is  given  in  any  form,  the  scalars 

are  easily  determined.  We  have  therefore  a  cubic  equation  in  a,  for 
which  we  can  find  at  least  one  and  perhaps  three  roots.  That  is,  we 
can  find  at  least  one  value  of  a,  and  perhaps  three,  which  will  satisfy 
the  equation  |$-al|«0. 

By  substitution  of  such  a  value,  #— ol  becomes  a  planar  dyadic,  the 
planes  of  which  may  be  easily  determined  t  Let  a  be  a  vector 
normal  to  the  plane  of  the  consequents.    Then 

{#-aI}.a-0, 
$.a=aa. 
If  $  is  a  tonic,  we  may  obtain  three  equations  of  this  kind,  say- 
$.a=<H*9        #.£  =»  b/3f        $.y  »  ey, 
in  which  a,  /8,  y  are  not  oomplanar.    Hence  (by  No.  108X 

where  a',  /3",  y  are  the  reciprocals  of  a,  £,  y. 

In  any  case,  we  may  suppose  a  to  have  the  same  sign  as  |#|,  since 
the  cubic  equation  must  have  such  a  root  Let  a  (as  before)  be 
normal  to  the  plane  of  the  consequents  of  the  planar  $— cH,  and  a 
normal  to  the  plane  of  the  antecedents,  the  lengths  of  a  and  a  being 
such  that  a .  a' =14  Let  /3  be  any  vector  normal  to  a',  and  such  that 
$./8  is  not  parallel  to  /3.  (The  case  in  which  $.0  is  always  parallel 
to  0,  if  0  is  perpendicular  to  a,  is  evidently  that  of  a  tonic,  and  needs 
no  farther  discussion.)  {#— al}.0  and  therefore  $./8  will  be  per- 
pendicular to  a'.    The  same  will  be  true  of  &./3.    Now  (by  No.  140) 

[*o].[«».jB]x[*.ffl-|*|«.[*fflxA 
that  is,  M.[&.fflx[*.fflH*l«-[*-fflx£ 

•  [See  note  on  p.  90.] 

tin  particular  oaeee,  *-oI  may  reduce  to  a  linear  dyadic,  or  to  aero.    Theae,  how- 
erer,  will  preaent  no  difficulties  to  the  atndent. 
X  For  the  oaae  in  which  the  two  planet  are  perpendicular  to  each  other,  aee  No.  157. 
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Hence,  since  [^.flxfS./S]    and    [$.ftx)8  are  parallel, 

a[&.0lx[*./3]=\*\[$.l3]xp. 
Since  a*1^!  is  positive,  we  may  set 

If  we  also  set 

fix-p-1*./*.        ft^-'^.ft    etc, 
i8M-p*-l.A       ftf=p«*-2.ft    etc, 
the  vectors  ft  ft,  ft,  etc,  j8_lf  #.2,  etc,  will  all  lie  in  the  plane 
perpendicular  to  a',  and  we  shall  have 

ftx&^ftxft 
[fc+fflxft-O. 
We  may  therefore  set  ft+£= 2nft. 

Multiplying  by  j*"1^,  and  by  pi'1, 

ft+ft  =  27ift,        ft+ft=2nft,    etc, 
ft+0M  =  2nft        0+ft2= 2n^.lf    etc 
Now,  if  7i  >  1,  and  we  lay  off  from  a  common  origin  the  vectors 

A  &>  &»  e*0-*  0-i»  £_f,  etc, 
the  broken  line  joining  the  termini  of  these  vectors  will  be  convex 
toward  the  origin.  All  these  vectors  must  therefore  lie  between  two 
limiting  lines,  which  may  be  drawn  from  the  origin,  and  which  may 
be  described  as  having  the  directions  of  ft  and  ft„*  A  vector 
having  either  of  these  directions  is  unaffected  in  direction  by 
multiplication  by  $.  In  this  case,  therefore,  $  is  a  tonic  If  n  <  —  1 
we  may  obtain  the  same  result  by  considering  the  vectors 

ft  — &,  ft,  —ft,  ft,  etc,  —  £M,  j8_,,  — /S-s,  ©tc, 
except  that  a  vector  in  the  limiting  directions  will  be  reversed  in 
direction    by    multiplication    by    $,    which   implies   that   the   two 
corresponding  coefficients  of  the  tonic  are  negative. 
If  1  >  n  >  —  l,t  we  may  set 

7i=  cos  q. 

Then  j8.1+ft=2cosgft 

Let  us  now  determine  y  by  the  equation 

ft  =  cosg0+singy. 
This  gives  £.x = cos q/3— sing y. 

Now  a  is  one  of  the  reciprocals  of  a,  ft  and  y.    Let  ft  and  y  be  the 
others.    If  we  set 

*=cos^{i8ft+yy/}+sin9{yft-/3y'}, 

we  have  ¥.a  =  0,        ¥.£=ft,        ^.ft^ft 


*  The  termini  of  the  veotors  will  in  fact  lie  on  a  hyperbola, 
tFor  the  limiting  oases,  in  which  »=1,  or  »»  - 1,  tee  No.  166. 
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Therefore,  since  {aaa'+p^.a  =oa=io, 

{aaa'+pVyp^p&^Q.p, 

{a«t+p9}.fi.1-pfi-*.fi.v 
it  follows  (by  No.  108)  that 

$  =  aaa'+p<fr=aaa  +  pcoaqiPff+yy'}  +  J>sing{y|8'-/8y'}. 

156.  It  will  be  sufficient  to  indicate  (without  demonstration)  the 
forms  of  dyadics  which  belong  to  the  particular  cases  which  have 
been  passed  over  in  the  preceding  paragraph,  so  far  as  they  present 
any  notable  peculiarities. 

If  n=  ±1  (page  72),  the  dyadic  may  be  reduced  to  the  form 

aaa'+b{PP+yy'}  +bc$y' , 

where  a,  0,  y  are  three  non-complanar  vectors,  a',  /S',  y  their  reci- 
procals, and  a,  b,  c  positive  or  negative  scalars.  The  effect  of  this  as  an 
operator,  will  be  evident  if  we  resolve  it  into  the  three  homologous 
factors 

aaa  +/3f¥ +yy', 

aa'+btfP +yy'}t 
oa'+)8/8'+yy'+c0y/. 

The  displacement  due  to  the  last  factor  may  be  called  a  simple  shear. 
It  consists  (when  the  dyadic  is  used  as  prefactor)  of  a  motion  parallel 
to  fi,  and  proportioned  to  the  distance  from  the  a-/8  plane.  This 
factor  may  be  called  a  shearer. 

This  dyadic  is  homologous  with  such  as  are  obtained  by  varying 
the  values  of  a,  6,  c,  and  only  with  such,  when  the  values  of  a  and  6 
are  different,  and  that  of  c  other  than  zero. 

157.  If  the  planar  $— al  (page  71)  has  perpendicular  planes,  there 
may  be  another  value  of  a,  of  the  same  sign  as  |  $  |,  which  will  give  a 
planar  which  has  not  perpendicular  planes.  .When  this  is  not  the 
case,  the  dyadic  may  always  be  reduced  to  the  form 

a\aa'+PP+yy'}+ab{aP+Py'}+aeay', 

where  a,  ft,  y  are  three  non-complanar  vectors,  a',  /ff,  y',  their  reci- 
procals, and  a,  6,  c,  positive  or  negative  scalars.  This  may  be  resolved 
into  the  homologous  factors 

al  and  I+b{aP+/3y  }+cay'. 

The  displacement  due  to  the  last  factor  may  be  called  a  complex  shear. 
It  consists  (when  the  dyadic  is  used  as  prefactor)  of  a  motion  parallel 
to  a  which  is  proportional  to  the  distance  from  the  a-y  plane,  together 
with  a  motion  parallel  to  b/3+ ca  which  is  proportional  to  the  distance 
from  the  a-f$  plane.    This  factor  may  be  called  a  complex  shearer 
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This  dyadic  is  homologous  with  such  as  are  obtained  by  varying 
the  values  of  a,  b,  c,  and  only  such,  unless  6»0. 

It  is  always  possible  to  take  three  mutually  perpendicular  vectors 
for  a,  ft,  and  y ;  or,  if  it  be  preferred,  to  take  such  values  for  these 
vectors  as  shall  make  the  term  containing  c  vanish. 

158.  The  dyadics  described  in  the  two  last  paragraphs  may  be 
called  shearing  dyadics. 

The  criterion  of  a  shearer  is 

{$-I}»=0,        #-1*0. 

The  criterion  of  a  simple  shearer  is 

{$-I}*=0,        $-1+0. 
The  criterion  of  a  complex  shearer  is 

{$-I}»»0,        {$-I}**0. 

Nor*.— If  a  dyadio  #  is  a  linear  Amotion  of  a  vector  p  (the  term  linear  being  vied  in 
the  same  eenee  at  in  No.  106),  we  may  represent  the  relation  by  an  equation  of  the  form 

*=a07.p+«i>p+ete., 

or  *=r{a0y+«fr+eto.}.p, 

where  the  expression  in  the  braces  may  be  called  a  triadic  polynomial,  and  a  single  term 
a07  a  triads  or  the  indeterminate  product  of  the  three  vectors  a,  0,  7.  We  are  thus  led 
suooestively  to  the  consideration  of  higher  orders  of  indeterminate  products  of  vector*, 
triad*,  tetrads,  etc,  in  general  potyads,  and  of  polynomials  consisting  of  such  terms, 
triadics,  tetradici,  etc,  in  general  polyadk*.  But  the  development  of  the  subject  in 
this  direction  lies  beyond  our  present  purpose. 
It  may  sometimes  be  convenient  to  use  notations  like 

to  represent  the  conjugate  dyadics  which,  the  first  ss  prefactor,  and  the  seoond  ss 
postfaotor,  change  a,  fi,  7  into  X,  n>  w%  respectively.  In  the  notations  of  the  preceding 
chapter  these  would  be  written 

/Xa'+fltf+r/  and  a-'X+fn+yr 

respectively,  a',  ft,  y  denoting  the  reciprocals  of  «,  0,  7.  If  r  is  a  linear  function 
of  p,  the  dyadics  which  ss  prefactor  and  postfaotor  change  p  into  r  may  be  written 
respectively 

^  snd  -^. 

\P  P{ 

If  r  is  any  function  of  p,  the  dyadics  which  as  prefactor  and  postfsotor  ohsnge  dp  into 
dr  may  be  written  respectively 

dr        .   dr 

In  the  notation  of  the  following  chapter  the  seoond  of  these  (when  p  denotes  a  posi- 
tion-vector) would  be  written  Vr.    The  trisdio  which  as  prefactor  changes  dp  into 

-i  may  be  written  -SL  and  that  which  as  postfaotor  changes  dp  into  £L  may  be 

\dp  \dfr  dp\ 

written  f^p.    The  latter  would  be  written  Wt  in  the  notations  of  the  following  chapter. 
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CHAPTER  IV. 

(SUPWJattWTABT  10  Oharb  n.) 

CONCERNING  THE  DIFFERENTIAL  AND  INTEGRAL  CALCULUS  OF  VECTORS. 

159.  If  *  is  a  vector  having  continuously  varying  values  in  space, 
and  p  the  vector  determining  the  position  of  a  point,  we  may  set 

dp**dxi+dyj+d*k, 
and  regard  *  as  a  function  of  p,  or  of  x,  y,  and  z. 


Then, 


that  is, 


If  we  set 


Here  V  stands  for 


*«        -doe  ,  -dw  .  7  duo 
d»s»dp.V«. 


exactly  as  in  No.  52,  except  that  it  is  here  applied  to  a  vector  and 
produces  a  dyadic,  while  in  the  former  case  it  was  applied  to  a 
scalar  and  produced  a  vector.  The  dyadic  V«  represents  the  nine 
differential  coefficients  of  the  three  components  of  w  with  respect  to 
x9  y,  and  *,  just  as  the  vector  Vu  (where  u  is  a  scalar  function  of  p) 
represents  the  three  differential  coefficients  of  the  scalar  u  with 
respect  to  x,  y,  and  s. 

It  is  evident  that  the  expressions  V.«  and  Vx«  already  denned 
(No.  54)  are  equivalent  to  {V«}8  and  {V*}x. 

160.  An  important  case  is  that  in  which  the  vector  operated  on  is 
of  the  form  Via.    We  have  then 


where 


Wu«. 


<Pu 


dVu~dp.VVu, 

cPu    ..        cPu 


ik 


.    <Pu    ..    .     dhi  ..       '   cPu   ., 

+3^*  +  dp*    +  dfr&k 


<Pu 


<Pu 


dhi 


This  dyadic,  which  is  evidently  self-conjugate,  represents  the  si 
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differential  coefficients  of  the  second  order  of  u  with  respect  to  x,  y, 
and  z* 

161.  The  operators  Vx  and  V.  may  be  applied  to  dyadics  in  a 
manner  entirely  analogous  to  their  use  with  scalars.  Thus  we  may 
define  Vx$  and  V.$  by  the  equations 

Then,  if  *=ai+£;+yJfe, 

Vx$=Vxai+Vx#;+Vxyfc 
V.$  «  V.  ai  +  V.  #?  +  V.y  fc. 
Or,  if  ***ia+jfj+hy, 

cue    ay    az 

162.  We  may  now  regard  V.V  in  expressions  like  V.Va>  as  repre- 
senting two  successive  operations,  the  result  of  which  will  be 

in  accordance  with  the  definition  of  No.  70.  We  may  also  write 
V.V$  for 

although  in  this  case  we  cannot  regard  V.V  as  representing  two 
successive  operations  until  we  have  defined  V$.t 

That  V.V$=VV.$— VxVx$  will  be  evident  if  we  suppose  $  to 
be  expressed  in  the  form  ai+/3j+yk    (See  No.  71.) 

163.  We  have  already  seen  that 

u"-tt'=/dp.Vu, 

where  vl  and  u"  denote  the  values  of  u  at  the  beginning  and  the  end 
of  the  line  to  which  the  integral  relates.  The  same  relation  will  hold 
for  a  vector ;  i.e., 

&" — to  ^jdp .  Vo>. 

*  We  might  proceed  to  higher  steps  in  differentiation  by  means  of  the  triadios  W«, 
VWic,  the  tetradios  VW«,  ?WVu,  etc.  See  note  on  page  74.  In  like  manner  a  dyadic 
function  of  position  in  space  ($)  might  be  differentiated  by  means  of  the  triadio  V*,  the 
tetradio  W*,  etc. 

tSee  footnote  to  No.  160. 
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164  The  following  equations  between  surface-integrals  far  a  closed 
surface  and  volume-integrals  for  the  space  enclosed  seem  worthy  of 
mention.  One  or  two  have  already  been  given,  and  are  here  repeated 
for  the  sake  of  comparison. 

ffcUru-fffdvVu,  (1) 

Xfd*»=fffdvV»,  (2) 

ff&r.o-fffdvV.*,  (3) 

ffd*.*=fffdvV.*t  (4) 

ffdvxw^fJTdvVx*  (5) 

£td<r**=fffdvVxi.  (6) 

It  may  aid  the  memory  of  the  student  to  observe  that  the  transfor- 
mation may  be  effected  in  each  case  by  substituting  fffdv  V  for  ffdv. 

165.  The  following  equations  between  line-integrals  for  a  closed 
line  and  surface-integrals  for  any  surface  bounded  by  the  line,  may 
also  be  mentioned.  (One  of  these  has  already  been  given.  See 
No.  60.) 

fdpu^JfdffxVu,  (1) 

fdpw-ffdvxV*,  (2) 

fdp.w~ffdr.Vxw,  (3) 

/dp.$=j^*r.Vx$,  (4) 

fdp*w=ffV*.d<r-ffd<rV.w.  (5) 

These  transformations  may  be  effected  by  substituting  Jf[d*  x  V]  for 
/dp.  The  brackets  are  here  introduced  to  indicate  that  the  multi- 
plication of  dor  with  the  i,j,  k  implied  in  V  is  to  be  performed  before 
any  other  multiplication  which  may  be  required  by  a  subsequent  sign. 
(This  notation  is  not  recommended  for  ordinary  use,  but  only  sug- 
gested as  a  mnemonic  artifice.) 

166.  To  the  equations  in  No.  65  may  be  added  many  others,  as, 

V[ua>]  =  Vtt«+uV«,  (1) 

V[tx«0  =  VtX»-V«xt,  (2) 

Vx[tx«]=«.Vt-V.w-t.V«+V.«>t,  (3) 

V(T.«)=VT.a>+V».T,  (4) 

V.{t«}  -  V.t«+t.V«,  (5) 

Vx{t«}  =  Vxt«-txV»,  (6) 

V.{u*}  =  Vtt.$+uV.$,  (7) 

etc 

The  principle  in  all  these  cases  is  that  if  we  have  one  of  the  operators 
V,  V.,  Vx  prefixed  to  a  product  of  any  kind,  and  we  make  any 
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transformation  of  the  expression  which  would  be  allowable  if  the  V 
were  a  vector  (viz.,  by  changes  in  the  order  of  the  factors,  in  the 
signs  of  multiplication,  in  the  parentheses  written  or  implied,  etc),  by 
which  changes  the  V  is  brought  into  connection  with  one  particular 
factor,  the  expression  thus  transformed  will  represent  the  part  of 
the  value  of  the  original  expression  which  results  from  the  variation 
of  that  factor. 

167.  From  the  relations  indicated  in  the  last  four  paragraphs,  may 
be  obtained  directly  a  great  number  of  transformations  -of  definite 
integrals  similar  to  those  given  in  Nos.  74-77,  and  corresponding  to 
those  known  in  the  scalar  calculus  by  the  name  of  integration  by 
parts. 

168.  The  student  will  now  find  no  difficulty  in  generalizing  the 
integrations  of  differential  equations  given  in  Nos.  78-89  by  applying 
to  vectors  those  which  relate  to  scalars,  and  to  dyadics  those  which 
relate  to  vectors. 

169.  The  propositions  in  No.  90  relating  to  minimum  values  of  the 
volume-integral  fffiua.wdv  may  be  generalized  by  substituting 
*.$.*  for  uo>.«,  $  being  a  given  dyadic  function  of  position  in  space. 

170.  The  theory  of  thto  integrals  wnich  have  been  called  potentials, 
Newtonians,  etc  (see  Nos.  91-102)  may  be  extended  to  cases  in  which 
the  operand  is  a  vector  instead  of  a  scalar  or  a  dyadic  instead  of  a 
vector.  So  far  as  the  demonstrations  are  concerned,  the  case  of  a 
vector  may  be  reduced  to  that  of  a  scalar  by  considering  separately 
its  three  components,  and  the  case  of  a  dyadic  may  be  reduced  to  that 
of  a  vector,  by  supposing  the  dyadic  expressed  in  the  form  <pi+xj+«k 
and  considering  each  of  these  terms  separately. 


CHAPTER  V. 
CONCERNING  TRANSCENDENTAL  FUNCTIONS  OF  DYADICS. 

171.  Def. — The  exponential  function,  the  sine  and  the  cosine  of  a 
dyadic  may  be  defined  by  infinite  series,  exactly  as  the  corresponding 
functions  in  scalar  analysis,  viz., 

e'-I+S+i^+iMP+etc., 
smS-S-iH^+ixo^-etc., 
eoeS-I-i^+rh^-etc. 
These  series  are  always  convergent    For  every  value  of  $  there  is 
one  and  only  one  value  of  each  of  these  functions.    The  exponential 
function  may  also  be  defined  as  the  limit  of  the  expression 

(■4)" 
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when  N,  which  is  a  whole  number,  is  increased  indefinitely.    That 
this  definition  is  equivalent  to  the  preceding,  will  appear  if  the 
expression  is  expanded  by  the  binomial  theorem,  which  is  evidently 
applicable  in  a  case  of  this  kind. 
These  functions  of  *  are  homologous  with  $. 

172.  We  may  define  the  logarithm  as  the  function  which  is  the 
inverse  of  the  exponential,  so  that  the  equations 

«♦-#, 

¥«log$, 
are   equivalent,  leaving  it  undetermined  for  the  present  whether 
every  dyadic  has  a  logarithm,  and  whether  a  dyadic  can  have  more 
than  one. 

173.  It  follows  at  once  from  the  second  definition  of  the  exponential 
function  that,  if  $  and  ¥  are  homologous, 

and  that,  if  T  is  a  positive  or  negative  whole  number, 

174.  If  S  and  *  are  homologous  dyadics,  and  such  that 

the  definitions  of  No.  171  give  immediately 
«*•♦■•  ooe$+K  sin*, 

6-*'+«oos$— Hsin$, 
whence 

coe*«  *{«"•+«-"•}, 
sin$«-iS{«l,#-«-,,#}. 

175.  If  $.*«¥.#«0, 

{#+¥}*=#*+¥•        {#+*}•-*»+*•,   etc 
Therefore  *#+*««#+«*-I, 

coe{$+¥}«coe#+cos¥-I, 
sin{$+¥}  -sin$+sin¥. 

|e#|«A 
For  the  first  member  of  this  equation  is  the  limit  of 

KI+N-1*}*!,    that  is,  of    |I+N-l#|*. 
If  we  set  $«ai+£;+yJfc,  the  limit  becomes  that  of 

(l+N-^a.i+N-^.i+N^y.Jb)11,    or    (1+N-**,)" 
the  limit  of  which  is  the  second  member  of  the  equation  to  be  proved. 
177.  By  the  definition  of  exponentials,  the  expression 

represents  the  limit  of 
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Now  I+gN-1^—  jk)  evidently  represents  a  versor  having  the  axis  i 
and  the  infinitesimal  angle  of  version  qS'\  Hence  the  above  ex- 
ponential represents  a  versor  having  the  same  axis  and  the  angle  of 
version  q.    If  we  set  gi= a>,  the  exponential  may  be  written 

Such  an  expression  therefore  represents  a  versor.  The  axis  and 
direction  of  rotation  are  determined  by  the  direction  of  »,  and  the 
angle  of  rotation  is  equal  to  the  magnitude  of  «.  The  value  of  the 
versor  will  not  be  affected  by  increasing  or  diminishing  the  magnitude 
of  «  by  2*\ 

178.  If,  as  in  No.  151, 

*=oaa'+6$8'+cyy', 
the  definitions  of  No.  171  give 

cos$«cosaaa'  +  coabf}f¥  +  coscyy', 
sin  §>= sin  aaa'  +  sin&$8/  +  sincyy'. 
If  a,  b,  c  are  positive  and  unequal,  we  may  add,  by  No.  172, 
log$>=logaact'  +  logbfiP  +  logcyy'. 

179.  If,  as  in  No.  153, 
*~aaa'+b{pp+yy')+c{yP-Py'} 

=*aaa'+p  cosg{^+yy'}+  p  sin  q{yf¥-fiy}, 
we  have  by  No.  173     ^_^  #  <#&+*} . <£*-*). 

But  e-Waa'+flg'+yy', 

e^H^-aa'+^jS/S'+yy'}, 

<f{*-*A=aa'  +  coscfAS'+yy'}  +  sinc{y/8'-/8y  }. 

Therefore,     fi^*^  +  e'coecfjS/S'+yy'}  +  e»sinc{yj8'-j8y/}. 

Hence,  if  a  is  positive, 

log*=tlogaaa/-|-logjp{i8i8'-|-yy/}-|-g{yi8'-)8y/}. 

Since  the  value  of  $  is  not  affected  by  increasing  or  dimininhing  q  by 
2x,  the  function  log  #  is  many- valued. 
To  find  the  value  of  cos  $  and  sin  $,  let  us  set 

0  =  6{i8^+yy'}+c{yi8'-i8y'}, 

Then,  by  No,  175, 

cos  # = cos  {aaa'}  +coe  0 — I 

But  cos  {aaa'}  - 1 = cos  aaa'  -  aa'. 

Therefore,  coe#=cosaaa'-aa'+co8  0. 
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Now,  by  No.  174, 

coee=J{«"e+«"ie}- 

Since  *         K.e=-c{i8i8'+yy'}+&{y0'-i8y'}, 

e**  =  aa'  +  e-coab  {flS'+yy'}  +  e-«sin6  {y^-jSy'}, 

e-M  =  aa  +eccos6{#8'+yy}  -  e'sinMy^'-jgy  }. 
Therefore 

coee  =  aa'  +  i(«fl+«")cos6{ft8'+yy'}  -  K*-«-«)smfc{y8'-£y'}, 

and 

co8#=coeaaa'+  *(«•+«"')«» 6  {^H-yy}  -  K^-O^^yiS'-iSy'}. 

In  like  manner  we  find 

sm4>=sin  aaa!  +  i(e°+e-)sinb{PP+Yy'}  +  i(#-e-c)cQ*b  {y^-fly  }. 

180.  If  a,  ft  y  and  «',  ^,  y'  are  reciprocals,  and 

#=aaa'+6{j8£'+yy'}+ci8y', 
and  N  is  any  whole  number, 

*»  =  aW+^{ftff+yy'}+N^->C)8y'. 
Therefore,  ^eW+e^jS/S'+yy'H^c/Sy', 

cos  # = cos  a  aa'+cos  6  {jfyS'+yy }  —  c  sin  6  j8y , 
sin  #=sin  a  aa'+sin  6  {iS/S'+yy'}  +c  cos  b  fiy\ 
If  a  and  b  are  unequal,  and  c  other  than  zero,  we  may  add 
log^^logaaa'+logb^^+yy'J+ct-^y. 

181.  If  a,  £,  y,  and  a',  #,  y  are  reciprocals,  and 

#  =  al  +  6{a£'+j8y'}+cay', 
and  N  is  a  whole  number, 

^-a^I+Na^-^faiff+iSy'J+CNa^c+JNCN-^a^^ay'. 

Therefore 

e*  =  e*I+*b{aP+Py'}+e*(lb>+c)ay, 
cos#=coeaI  — bsinafo^+^a'J—CJ^coea+csin^ay', 
sin  # = sin  a  I + b  cos  a  {aft + #a'}  —  ( J^sin  a — c  cos  a)ay'. 
Unless  6=0,  we  may  add 

log#=logaI  +  6a-1{ai8'+i8a,}+(ca-1-i6^-2)ay'.- 

182.  If  we  suppose  any  dyadic  $  to  vary,  but  with  the  limitation 
that  all  its  values  are  homologous,  we  may  obtain  from  the  definitions 
of  No.  171  7r  ♦,      +i^  /n 

tfsin$  =  coe$.<i#,  (2) 

dcos$=  —  sin$>.cH^,  (3) 

dlog^*-1.^,  (4) 
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as  in  the  ordinary  calculus,  but  we  roust  not  apply  these  equations  to 
cases  in  which  the  values  of  $  are  not  homologous. 

188.  Ify  however,  T  is  any  constant  dyadic,  the  variations  of  tT  will 
necessarily  be  homologous  with  tT,  and  we  may  write  without  other 
limitation  than  that  T  is  constant, 


3JL*- 

(i) 

^P-r.co8{<r}, 

(2) 

<*^P--.r.8in{*r}, 

(3) 

dlog{tT)    I 

dt     "r 

(4) 

A  second  differentiation  gives 

*gw. 

(5) 

^Il.-P.sinm 

(6) 

*2«jP--i.ao.<«r>. 

(7) 

184.  It  follows  that  if  we  have  a  differential  equation 

of  the  form 

*-«■* 

the  integral  equation  will  be  of  the  form 

P=e  .p, 

p  representing  .the  value  of  p  for  t=0.    For  this  gives 

J-rVV-r.* 

and  the  proper  value  of  p  for  2=0. 

185.  Def. — A  flux  which  is  a  linear  function  of  the  position- vector 
is  called  a  hoTnogeneovs-strain-fiux  from  the  nature  of  the  strain 
which  it  produces.  Such  a  flux  may  evidently  be  represented  by  a 
dyadic. 

In  the  equations  of  the  last  paragraph,  we  may  suppose  p  to 
represent  a  position- vector,  t  the  time,  and  T  a  homogeneous-strain- 
flux.  Then  etT  will  represent  the  strain  produced  by  the  flux  T  in 
the  time  t. 

In  like  manner,  if  A  represents  a  homogeneous  strain,  {logA}/6 
will  represent  a  homogeneous-strain-flux  which  would  produce  the 
strain  A  in  the  time  t 
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186.  If  we  have  - 

where  T  is  complete,  the  integral  equation  will  be  of  the  form 

p-/p.a  +  «-|p.£ 
For  this  gives  , 

J-r.^.a-r.«-,r.A 

and  a  and  j3  may  be  determined  so  as  to  satisfy  the  equations 

Bel  -*•«-*• 


187.  The  differential  equation 
will  be  satisfied  by 


S--IV 


p-eoe{er}.a  +  sm{tT}./3, 
whence  , 

J-  -r.sin{tr}.a  +  r.cos{<r}./8, 

^--H.oos^rj.a  -  P.sin{tr}.j3--P./>. 

If  r  is  complete,  the  constants  a  and  0  may  be  determined  to  satisfy 
the  equations 


HeL-«* 


where  P— A*  is  a  complete  dyadic,  and 

r.A=A.r-o, 

we  may  set 

,=  {|«*+ J«-|r+  cos{*A}  -1} .«+ {  J^-K^+sin  {*A}  }.£, 
which  gives 

g£«  {jr.e'r-ir.e-,r-A.sin{<A}  }.a 

+  {jr.eff+jr.«-<r+A.cos{tA}}.i8, 
^,  {jr».e"-+ JP.«-'r-A* .cos{«A} }.« 

+  {jP.«<r-JP.«-,r-At.8in{tA}  }.£ 
-{P-A«>.p. 
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The  constants  a  and  0  are  to  be  determined  by 

r* 


{r+A}.£ 

189.  It  will  appear,  on  reference  to  Nos.  155-157,  that  every 
complete  dyadic  may  be  expressed  in  one  of  three  forms,  viz.,  as  a 
square,  as  a  square  with  the  negative  sign,  or  as  a  difference  of  squares 
of  two  dyadics  of  which  both  the  direct  products  are  equal  to  sera 
It  follows  that  every  equation  of  the  form 

**  .6* 


3- 


where  6  is  any  constant  and  complete  dyadic,  may  be  integrated  by 
the  preceding  formula. 


NOTE  ON  BIVECTOR  ANALYSIS.* 

1.  A  vector  is  determined  by  three  algebraic  quantities.  It  often 
occurs  that  the  solution  of  the  equations  by  which  these  are  to  be 
determined  gives  imaginary  values,  i.e.,  instead  of  scalars  we  obtain 
biscalars,  or  expressions  of  the  form  a+ib,  where  a  and  b  are  scalars, 
and  i  —  */— 1.  It  is  most  simple,  and  always  allowable,  to  consider 
the  vector  as  determined  by  its  components  parallel  to  a  normal 
system  of  axes.    In  other  words,  a  vector  may  be  represented  in  .the 

form  m+nf+A 

Now  if  the  vector  is  required  to  satisfy  certain  conditions,  the  solution 
of  the  equations  which  determine  the  values  of  x,  yt  and  z,  in  the 
most  general  case,  will  give  results  of  the  form 

ft-^  +  t*!, 

*  Thus  far,  in  accordance  with  the  purpose  expressed  in  the  footnote  on  page  17,  we 
have  considered  only  real  values  of  scalars  and  vectors.  The  objeot  of  this  limitation 
has  been  to  present  the  subject  in  the  most  elementary  manner.  The  limitation  is 
however  often  inconvenient,  and  does  not  allow  the  most  symmetrical  and  complete 
development  of  the  subject  in  many  important  directions.  Thus  in  Chapter  V,  and  the 
latter  part  of  Chapter  HI,  the  exclusion  of  imaginary  values  has  involved  a  considerable 
sacrifioe  of  simplicity  both  in  the  enunciation  of  theorems  and  in  their  demonstration. 
The  student  will  find  an  interesting  and  profitable  exercise  in  working  over  this  part  of 
the  subjeot  with  the  aid  of  imaginary  values,  especially  in  the  discussion  of  the 
imaginary  roots  of  the  cubic  equation  on  page  71,  and  in  the  use  of  the  formula 

e'*=oos*+4sin* 
in  developing  the  properties  of  the  tines,  cosines,  and  exponentials  of  dyadics. 
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where  xlf  x2,  ylt  yt,  zlt  zs  are  scalars.    Substituting  these  values  in 

we  obtain  fo+^«)*+(yi+<y«)j +(*i+<sf)* ; 

or,  if  we  set  Pi^Xii+yJ+zJc, 

Ptma*ti+VJ+*& 
we  obtain  /h+«Pr 

We  shall  call  this  a  bivector,  a  term  which  will  include  a  vector  as  a 
particular  case.  When  we  wish  to  express  a  bivector  by  a  single 
letter,  we  shall  use  the  small  German  letters.    Thus  we  may  write 

An  important  case  is  that  in  which  p1  and  p^  have  the  same  direction. 
The  bivector  may  then  be  expressed  in  the  form  (a+ib)p,  in  which 
the  vector  factor,  if  we  choose,  may  be  a  unit  vector.  In  this  case,  we 
may  say  that  the  bivector  has  a  real  direction.  In  fact,  if  we  express 
the  bivector  in  the  form 

(aj1+iajf)i-h(y1+iyj)i+(»1+i«2)* 
the  ratios  of  the  coefficients  of  i,  j,  and  i,  which  determine  the  direc- 
tion cosines  of  the  vector,  will  in  this  case  be  real 

2.  The  consideration  that  operations  upon  bivectors  may  be  regarded 
as  operations  upon  their  biscalar  x-y  y-  and  z-components  is  sufficient 
to  show  the  possibility  of  a  bivector  analysis  and  to  indicate  what  its 
rules  must  be.  But  this  point  of  view  does  not  afford  the  most  simple 
conception  of  the  operations  which  we  have  to  perform  upon  bivectors. 
It  is  desirable  that  the  definitions  of  the  fundamental  operations  should 
be  independent  of  such  extraneous  considerations  as  any  system  of 
axes. 

The  various  signs  of  our  analysis,  when  applied  to  bivectors,  may 
therefore  be  defined  as  follows,  viz., 

The  bivector  equation 

/    ,        f  ft    ,        tr 

fl  +4V  =/i    +IV 

implies  the  two  vector  equations 

p.  =/i  ,    ana    v  =y  . 
— [m+u']!=  — /*+«[—"]• 

[pl  +  &]  •  [m" + »'"] = [m'«  m" — v.  i>"]  4- 1  [m'.  y" + y'*n "\ 

♦  (a + 46)  \ji + w] = a/i  -  hw + i[a* + b/i\. 
[p+u>](a+tb)=Ha-9b+i[Mb+pa]. 
Therefore  the  position  of  the  scalar  factor  is  indifferent.    [MS.  note  by  author.] 
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With  these  definitions,  a  great  part  of  the  laws  of  vector  analysis 
may  be  applied  at  once  to  bivector  expressions.  But  an  equation 
which  is  impossible  in  vector  analysis  may  be  possible  in  bivector 
analysis,  and  in  general  the  number  of  roots  of  an  equation,  or  of  the 
values  of  a  function,  will  be  different  according  as  we  recognise, 
or  do  not  recognize,  imaginary  values. 

8.  Def — Two  bivectors,  or  two  biscalars,  are  said  to  be  conjugate, 
when  their  real  parts  are  the  same,  and  their  imaginary  parts  differ 
in  sign,  and  in  sign  only. 

Hence,  the  product  of  the  conjugates  of  any  number  of  bivectors 
and  biscalars  is  the  conjugate  of  the  product  of  the  bivectors  and 
biscalars.    This  is  true  of  any  kind  of  product 

The  products  of  a  vector  and  its  conjugate  are  as  follows : 

[m+«|'].[m— w]»fl.fl+¥.V 

Hence,  if  fi  and  tv  represent  the  real  and  imaginary  parts  of  a 
bivector,  the  values  of 

/Ji.fl+v.p,        fixv;        mm  +  w,         17*— pur, 

are  not  affected  by  multiplying  the  bivector  by  a  biscalar  of  the  form 
a+i&,  in  which  a*+V=  1,  say  a  cyclic  scalar.    Thus,  if  we  set 

we  shall  have 

M'_ |y'  =  (a—  16)  Di  —  tv], 

and 
That  is, 

fl./JL  +V.V  =s/*./*+V.|', 

and  so  in  the  other  cases. 

4.  Def. — In  biscalar  analysis,  the  product  of  a  biscalar  and?  its  conju- 
gate is  a  positive  scalar.  The  positive  square  root  of  this  scalar  is 
called  the  modulus  of  the  biscalar.  In  bivector  analysis,  the  direct 
product  of  a  bivector  and  its  conjugate  is,  as  seen  above,  a  positive 
scalar.  The  positive  square  root  of  this  scalar  may  be  called  the 
modulus  of  the  bivector.  When  this  modulus  vanishes,  the  bivector 
vanishes,  and  only  in  this  case.  If  the  bivector  is  multiplied  by  a 
biscalar,  its  modulus  is  multiplied  by  the  modulus  of  the  biscalar. 
The  conjugate  of  a  (real)  vector  is  the  vector  itself,  and  the  modulus 
of  the  vector  is  the  same  as  its  magnitude. 

5.  Def. — If  between  two  vectors,  a  and  ft,  there  subsists  a  relation 
of  the  form 

a— nft, 
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where  n  is  a  scalar,  we  say  that  the  vectors  are  parallel.  Analogy 
leads  ns  to  call  two  biveccors  parallel,  when  there  subsists  between 
them  a  relation  of  the  form 

a=mb, 

where  m  (in  the  most  general  case)  is  a  biscalar. 

To  aid  ns  in  comprehending  the  geometrical  signification  of  this 
relation,  we  may  regard  the  biscalar  as  consisting  of  two  factors,  one 
of  which  is  a  positive  scalar  (the  modulus  of  the  biscalar),  and  the 
other  may  be  put  in  the  form  cos  g+i  sing.  The  effect  of  multiplying 
a  bivector  by  a  positive  scalar  is  obvious.  To  understand  the  effect  of 
a  multiplier  of  the  form  cos q +« sing  upon  a  bivector  /*H-«y,  let  us  set 

M  +«/— (cob  q+i  sin  q)[p+iv]. 
We  have  then 

/i'«coeg/i  —  sing?, 

v'*=coBqv  +  sing/x. 

Now  if  fi  and  v  are  of  the  same  magnitude  and  at  right  angles,  the 
effect  of  the  multiplication  is  evidently  to  rotate  these  vectors  in 
their  plane  an  angular  distance  g,  which  is  to  be  measured  in  the 
direction  from  v  to  p.  In  any  case  we  may  regard  fi  and  v  as  the  pro- 
jections  (by  parallel  lines)  of  two  perpendicular  vectors  of  the  same 
length.  The  two  last  equations  show  that  n'  and  v  will  be  the 
projections  of  the  vectors  obtained  by  the  rotation  of  these  perpendi- 
cular vectors  in  their  plane  through  the  angle  q.  Hence,  if  we 
construct  an  ellipse  of  which  fi  and  v  are  conjugate  semi-diameters,  /*' 
and  v  will  be  another  pair  of  conjugate  semi-diameters,  and  the  sectors 
between  fi  and  fi'  and  between  v  and  v\  will  each  be  to  the  whole  area 
of  the  ellipse  as  q  to  2x,  the  sector  between  v  and  v  lying  on  the  same 
side  of  v  and  fit  and  that  between  fi  and  p'  lying  on  the  same  side 
of  fi  as  —v. 

It  follows  that  any  bivector  p+iv  may  be  put  in  the  form 

(cos  q + 1  sin  q)  [a  +  iffl, 

in  which  a  and  /3  are  at  right  angles,  being  the  semi-axes  of  the  ellipse 
of  which  fi  and  v  are  conjugate  semi-diameters.  This  ellipse  we  may 
call  the  directional  ellipse  of  the  bivector.  In  the  case  of  a  real 
vector,  or  of  a  vector  having  a  real  direction,  it  reduces  to  a  straight 
line.  In  any  other  case,  the  angular  direction  from  the  imaginary  to 
the  real  part  of  the  bivector  is  to  be  regarded  as  positive  in  the  ellipse, 
and  the  specification  of  the  ellipse  must  be  considered  incomplete 
without  the  indication  of  this  direction. 

Parallelism  of  bivectors,  then,  signifies  the  similarity  and  similar 
position  of  their  directional  ellipses.  Similar  position  includes  iden- 
tity of  the  angular  directions  mentioned  above. 
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6.  To  reduce  a  given  bivector  x  to  the  above  form,  we  may  set 

t.t=(coe  q+i  sin  ?)2[a+ij8].[a+ij8] 
=(cos  2?  +  <  sin  2?)  (a.a-)8.i8) 
=  a+ib, 

where  a  and  6  are  scalars,  which  we  may  regard  as  known.  The 
value  of  q  may  be  determined  by  the  equation 

tan2<7  =  -, 

the  quadrant  to  which  2g  belongs  being  determined  so  as  to  give 
sin  2q  and  cos  2q  the  same  signs  as  b  and  a.  Then  a  and  jS  will  be 
given  by  the  equation 

a+4£=s(co8£  —  £sing)r. 

The  solution  is  indeterminate  when  the  real  and  imaginary  parts  of 
the  given  bivector  are  perpendicular  and  equal  in  magnitude.  In  this 
case  the  directional  ellipse  is  a  circle,  and  the  bivector  may  be  called 
circular.    The  criterion  of  a  circular  bivector  is 

r.r=0. 

It  is  especially  to  be  noticed  that  from  this  equation  we  cannot 
conclude  that 

r=0, 

as  in  the  analysis  of  real  vectors.  This  may  also  be  shown  by  ex- 
pressing r  in  the  form  xi+yj+zkt  in  which  x,  y,  z  are  biscalars.  The 
equation  then  becomes 

a2+y2+s2=0, 

which  evidently  does  not  require  xt  y,  and  z  to  vanish,  as  would  be 
the  case  if  only  real  values  are  considered. 

7.  Def. — We  call  two  vectors  p  and  <r  perpendicular  when  /».<r  =  0. 
Following  the  same  analogy,  we  shall  call  two  bivectors  t  and  6 
perpendicular,  when 

r.«=0. 

In  considering  the  geometrical  signification  of  this  equation,  we  shall 
first  suppose  that  the  real  and  imaginary  components  of  r  and  3  lie  in 
the  same  plane,  and  that  both  r  and  3  have  not  real  directions.  It  is 
then  evidently  possible  to  express  them  in  the  form 

where  m  and  m  are  biscalar,  a  and  /3  are  at  right  angles,  and  a' 
parallel  with  £.     Then  the  equation  t.S=0  requires  that 

,8.^  =  0,     and     a.jS'+jS.a'-O. 

This  shows  that  the  directional  ellipses  of  the  two  bivectors  are 
similar  and  the  angular  direction  from  the  real  to  the  imaginary 
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component  is  the  same  in  both,  but  the  major  axes  of  the  ellipses  are 
perpendicular.  The  case  in  which  the  directions  of  r  and  6  are  real, 
forms  no  exception  to  this  rule. 

It  will  be  observed  that  every  circular  bivector  is  perpendicular  to 
itself,  and  to  every  parallel  bivector. 

If  two  bivectors,  /jl+iv,  h'+iv,  which  do  not  lie  in  the  same  plane 
are  perpendicular,  we  may  resolve  fi  and  v  into  components  parallel 
and  perpendicular  to  the  plane  of  /*'  and  v.  The  components  perpen- 
dicular to  the  plane  evidently  contribute  nothing  to  the  value  of 

Therefore  the  components  of  fi  and  v  parallel  to  the  plane  of  /*',  v, 
form  a  bivector  which  is  perpendicular  to  fi'+w'.  That  is,  if  two 
bivectors  are  perpendicular,  the  directional  ellipse  of  either,  projected 
upon  the  plane  of  the  other  and  rotated  through  a  quadrant  in  that 
plane,  will  be  similar  and  similarly  situated  to  the  directional  ellipse 
of  the  second. 

8.  A  bivector  may  be  divided  in  one  and  only  one  way  into  parts 
parallel  and  perpendicular  to  another,  provided  that  the  second  is  not 
circular.    If  a  and  6  are  the  bivectors,  the  parts  of  a  will  be 

b.at        ,         b.at 
06    and    0-ob. 

If  6  is  circular,  the  resolution  of  a  is  impossible,  unless  it  is  perpen- 
dicular to  6.    In  this  case  the  resolution  is  indeterminate. 

9.  Since  axb.a«0,  and  axb.b—0,  axb  is  perpendicular  to  a  and  b. 
We  may  regard  the  plane  of  the  product  as  determined  by  the 
condition  that  the  directional  ellipses  of  the  factors  projected  upon 
it  become  similar  and  similarly  situated.  The  directional  ellipse  of 
the  product  is  similar  to  these  projections,  but  its  orientation  is 
different  by  90°.  It  may  easily  be  shown  that  axb  vanishes  only 
with  a  or  b,  or  when  a  and  b  are  parallel. 

10.  The  bivector  equation 

(axf.c)b-(b.cxb)a+(c.bxa)b-(b.axb)c=0 

is  identical,  as  may  be  verified  by  substituting  expressions  of  the 
form  wi+yj+zk  (x,  y,  z  being  biscalars),  for  each  of  the  bivectors. 
(Compare  No.  87.)  This  equation  shows  that  if  the  product  axb 
of  any  two  bivectors  vanishes,  one  of  these  will  be  equal  to  the  other 
with  a  biscalar  coefficient,  that  is,  they  will  be  parallel,  according 
to  the  definition  given  above.  If  the  product  a.bxc  of  any  three 
bivectors  vanishes,  the  equation  shows  that  one  of  these  may  be 
expressed  as  a  sum  of  the  other  two  with  biscalar  coefficients.  In 
this  case,  we  may  say  (from  the  analogy  of  the  scalar  analysis)  that 
the  three  bivectors  are  complanar.    (This  does  not  imply  that  they 
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lie  in  any  same  real  plane.)     If  a.bxc  is  not  equal  to  zero,  the 
equation  shows  that  any  fourth  bivector  may  be  expressed  as  a  sum 
of  a,  6,  and  c  with  biscalar  coefficients,  and  indicates  how  these 
coefficients  may  be  determined. 
11.  The  equation 

(r.a)6xe+(T.b)cxa+(r.c)axb~(axb.c)t 

is  also  identical,  as  may  easily  be  verified.    If  we  set 

c«=axb, 

and  suppose  that  ~  *     ~ 

rr^  r.a=0,      r.b=0, 

the  equation  becomes 

(t.axb)axb*(axb.axb)t. 

This  shows  that  if  a  bivector  t  is  perpendicular  to  two  bivectors 
a  and  b,  which  are  not  parallel,  t  will  be  parallel  to  axb.  Therefore 
all  bivectors  which  are  perpendicular  to  two  given  bivectors  are 
parallel  to  each  other,  unless  the  given  two  are  parallel. 


[NoU  by  Editor*.— The  notation  |+ 1  *0->,  used  on  page  64,  was  later  improved  by  the 
author  by  the  introduction  of  his  Double  Multiplication,  according  to  which  the  above 
expression  is  represented  by  *s,  and  |*|  by  *,.  See  this  volume,  pages  112,  ISO, 
and  181.  For  an  extended  treatment  of  Professor  Qibbs's  researches  on  Double  Multi- 
plication in  their  application  to  Vector  Analysis  see  pp.  906-321,  and  333  of  **  Veotor 
Analysis,"  by  E.  B.  Wilson,  Chas.  Soribner's  Sons,  New  York,  1901.] 
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ON  MULTIPLE  ALGEBRA. 

address  before  the  8ection  of  mathematics  and  astbokomy  of  the  american 
Association  for  the  Advancement  of  Science,  bt  the  Vice-President. 

[Proceedings  of  the  American  Association  for  the  Advancement  of  Science, 
vol  xxxv.  pp.  37-66,  1886.] 

It  has  been  said  that  "  the  human  mind  has  never  invented  a  labor- 
saving  machine  equal  to  algebra."*  If  this  be  true,  it  is  but  natural 
and  proper  that  an  age  like  our  own,  characterized  by  the  multi- 
plication of  labor-saving  machinery,  should  be  distinguished  by  an 
unexampled  development  of  this  most  refined  and  most  beautiful  of 
machines.  That  such  has  been  the  case,  none  will  question.  The 
improvement  has  been  in  every  part.  Even  to  enumerate  the  principal 
lines  of  advance  would  be  a  task  for  any  one;  for  me  an  impossibility. 
But  if  we  should  ask,  in  what  direction  the  advance  has  been  made 
which  is  to  characterize  the  development  of  algebra  in  our  day,  we 
may,  I  think,  point  to  that  broadening  of  its  field  and  methods  which 
gives  us  multiple  algebra. 

Of  the  importance  of  this  change  in  the  conception  of  the  office  of 
algebra,  it  is  hardly  necessary  to  speak :  that  it  is  really  characteristic 
of  our  time  will  be  most  evident  if  we  go  back  some  two  or  three- 
score years,  to  the  time  when  the  seeds  were  sown  which  are  now 
yielding  so  abundant  a  harvest.  The  failure  of  Mabius,  Hamilton, 
Grassmann,  Saint- Venant  to  make  an  immediate  impression  upon 
the  course  of  mathematical  thought  in  any  way  commensurate  with 
the  importance  of  their  discoveries  is  the  most  conspicuous  evidence 
that  the  times  were  not  ripe  for  the  methods  which  they  sought  to 
introduce.  A  satisfactory  theory  of  the  imaginary  quantities  of 
ordinary  algebra,  which  is  essentially  a  simple  case  of  multiple 
algebra,  with  difficulty  obtained  recognition  in  the  first  third  of  this 
century.  We  must  observe  that  this  double  algebra,  as  it  has  been 
called,  was  not  sought  for  or  invented; — it  forced  itself,  unbidden, 
upon  the  attention  of  mathematicians,  and  with  its  rules  already 
formed. 


*  The  Nation,  voL  xxxiii,  p.  287. 
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But  the  idea  of  doable  algebra,  once  received,  although  as  it  were 
unwillingly,  must  have  suggested  to  many  minds  more  or  less 
distinctly  the  possibility  of  other  multiple  algebras,  of  higher  orders, 
possessing  interesting  or  useful  properties. 

The  application  of  double  algebra  to  the  geometry  of  the  plane 
suggested  not  unnaturally  to  Hamilton  the  idea  of  a  triple  algebra 
which  should  be  capable  of  a  similar  application  to  the  geometry  of 
three  dimensions.  He  was  unable  to  find  a  satisfactory  triple  algebra, 
but  discovered  at  length  a  quadruple  algebra,  quaternions,  which 
answered  his  purpose,  thus  satisfying,  as  he  says  in  one  of  his  letters, 
an  intellectual  want  which  had  haunted  him  at  least  fifteen  years. 
So  confident  was  he  of  the  value  of  this  algebra,  that  the  same  hour 
he  obtained  permission  to  lay  his  discovery  before  the  Royal  Irish 
Academy,  which  he  did  on  November  13,  1843.*  This  system  of 
multiple  algebra  is  far  better  known  than  any  other,  except  the 
ordinary  double  algebra  of  imaginary  quantities, — far  too  well  known 
to  require  any  especial  notice  at  my  hands.  All  that  here  requires 
our  attention  is  the  close  historical  connection  between  the  imaginaries 
of  ordinary  algebra  and  Hamilton's  system,  a  fact  emphasised  by 
Hamilton  himself  and  most  writers  on  quaternions.  It  was  quite 
otherwise  with  Mobius  and  Grassmann. 

The  point  of  departure  of  the  Ba^centrischer  Calcul  of  Mttbius, 
published  in  1827, — a  work  of  which  Clebsch  has  said  that  it  can 
never  be  admired  enough,t — is  the  use  of  equations  in  which  the 
terms  consist  of  letters  representing  points  with  numerical  coefficients, 
to  express  barycentric  relations  between  the  points.  Thus,  that  the 
point  8  is  the  centre  of  gravity  of  weights,  a,  6,  c,  d,  placed  at  the 
points  A,  B,  (7,  D,  respectively,  is  expressed  by  the  equation 

(a+b+c+d)S=aA+bB+cC+dD. 

An  equation  of  the  more  general  form 

aA  +6£+cC+etc.  =pP+gQ+ri*+ etc. 

signifies  that  the  weights  a,  b,  c,  etc.,  at  the  points  A,  B,  C,  etc.,  have 
the  same  sum  and  the  same  centre  of  gravity  as  the  weights  p,  qf  r, 
etc.,  at  the  points  P,  Q,  R,  etc,  or,  in  other  words,  that  the  former  are 
barycentrically  equivalent  to  the  latter.  Such  equations,  of  which 
each  represents  four  ordinary  equations,  may  evidently  be  multiplied 
or  divided  by  scalars,i  may  be  added  or  subtracted,  and  may  have 


*  PhU.  Mag.  (3),  vol  xxv,  p.  490 ;  North  British  Review,  vol  xlv  (1866),  p.  57. 

fSee  his  eulogy  on  Pltioker,  p.  14,  OdU.  Abhandl.,  vol.  xvi. 

X I  use  this  terra  in  Hamilton's  sense,  to  denote  the  ordinary  positive  and  negative 
quantities  of  algebra.  It  may,  however,  be  observed  that  in  most  oases  in  whioh  I  shall 
have  occasion  to  use  it,  the  proposition  would  hold  without  exclusion  of  imaginary 
quantities,— that  this  exoluaion  is  generally  for  simplicity  and  not  from  necessity. 
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their  terms  arranged  and  transposed,  exactly  like  the  ordinary 
equations  of  algebra.  It  follows  that  the  elimination  of  letters 
representing  points  from  equations  of  this  kind  is  performed  by  the 
rules  of  ordinary  algebra.  This  is  evidently  the  beginning  of  a 
quadruple  algebra,  and  is  identical,  as  far  as  it  goes,  with  Grassmann's 
marvellous  geometrical  algebra. 

In  the  same  work  we  find  also,  for  the  first  time  so  far  as  I  am 
aware,  the  distinction  of  positive  and  negative  consistently  carried 
out  on  the  designation  of  segments  of  lines,  of  triangles,  and  of  tetra- 
hedra,  viz.,  that  a  change  in  place  of  two  letters,  in  such  expressions 
as  AB,  ABC,  ABCD,  is  equivalent  to  prefixing  the  negative  sign.  It 
is  impossible  to  overestimate  the  importance  of  this  step,  which  gives 
to  designations  of  this  kind  the  generality  and  precision  of  algebra. 

Moreover,  if  A,  B,  0  are  three  points  in  the  same  straight  line,  and 
D  any  point  outside  of  that  line,  the  author  observes  that  we  have 

AB+BC+CA  =  0, 

and  also,  with  D  prefixed, 

DAB+DBC+DCA=0. 

Again,  if  A9  B,  0,  D  are  four  points  in  the  same  plane,  and  E  any 
point  outside  of  that  plane,  we  have 

ABC-BCD+CDA-DAB=0, 

and  also,  with  E  prefixed, 

EABC-EBCD+ECDA-EDAB~0. 

The  similarity  to  multiplication  in  the  derivation  of  these  formulae 
cannot  have  escaped  the  author's  notice.  Yet  he  does  not  seem  to 
have  been  able  to  generalize  these  processes.  It  was  reserved  for  the 
genius  of  Grassmann  to  see  that  AB  might  be  regarded  as  the 
product  of  A  and  B9  DAB  as  the  product  of  D  and  AB,  and  EABC 
as  the  product  of  E  and  jlBO.  That  Mttbius  could  not  make  this  step 
was  evidently  due  to  the  fact  that  he  had  not  the  conception  of  the 
addition  of  other  multiple  quantities  than  such  as  may  be  represented 
by  masses  situated  at  points.  Even  the  addition  of  vectors  (i.e.,  the 
fact  that  the  composition  of  directed  lines  could  be  treated  as  an 
addition)  seems  to  have  been  unknown  to  him  at  this  time,  although 
he  subsequently  discovered  it,  and  used  it  in  his  Mechxmik  dee 
HimmeU,  which  was  published  in  1843.  This  addition  of  vectors, 
or  geometrical  addition,  seems  to  have  occurred  independently  to 
many  persons. 

Seventeen  years  after  the  Barycentrischer  Calcvl,  in  1844,  the  year 
in  which  Hamilton's  first  papers  on  quaternions  appeared  in  print, 
Grassmann  published  his  Lineale  AuadehnungeUhre,  in  which  he 
developed  the  idea  and  the  properties  of  the  external  or  combinatorial 
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product,  a  conception  which  is  perhaps  to  be  regarded  as  the  greatest 
monument  of  the  author's  genius.  This  volume  was  to  have  been 
followed  by  another,  of  the  nature  of  which  some  intimation  was 
given  in  the  preface  and  in  the  work  itself.  We  are  especially  told 
that  the  internal  product*  which  for  vectors  is  identical  except  in 
sign  with  the  scalar  part  of  Hamilton's  product  (just  as  Grassmann's 
external  product  of  two  vectors  is  practically  identical  with  the 
vector  part  of  Hamilton's  product),  and  the  open  product,!  which  in 
the  language  of  to-day  would  be  called  a  matrix,  were  to  be  treated 
in  the  second  volume.  But  both  the  internal  product  of  vectors  and 
the  open  product  are  clearly  defined,  and  their  fundamental  properties 
indicated,  in  this  first  volume. 

This  remarkable  work  remained  unnoticed  for  more  than  twenty 
years,  a  fact  which  was  doubtless  due  in  part  to  the  very  abstract  and 
philosophical  manner  in  which  the  subject  was  presented.  In  con- 
sequence of  this  neglect  the  author  changed  his  plan,  and  instead  of  a 
supplementary  volume  published  in  1862  a  single  volume  entitled 
AusdehnvmgsUhre,  in  which  were  treated,  in  an  entirely  different 
style,  the  same  topics  as  in  the  first  volume,  as  well  as  those  which  he 
had  reserved  for  the  second. 

Deferring  for  the  moment  the  discussion  of  these  topics  in  order  to 
follow  the  course  of  events,  we  find  in  the  year  following  the  first 
Auedehnungdehre  a  remarkable  memoir  of  Saint- Venant  J,  in  which 
are  clearly  described  the  addition  both  of  vectors  and  of  oriented 
areas,  the  differentiation  of  these  with  respect  to  a  scalar  quantity, 
and  a  multiplication  of  two  vectors  and  of  a  vector  and  an  oriented 
area.  These  multiplications,  called  by  the  author  geometrical,  are 
entirely  identical  with  Orassmann's  external  multiplication  of  the 
same  quantities. 

It  is  a  striking  fact  in  the  history  of  the  subject,  that  the  short 
period  of  less  than  two  years  was  marked  by  the  appearance  of 
well-developed  and  valuable  systems  of  multiple  algebra  by  British, 
German,  and  French  authors,  working  apparently  entirely  inde- 
pendently of  one  another.  No  system  of  multiple  algebra  had 
appeared  before,  so  far  as  I  know,  except  such  as  were  confined  to 
additive  processes  with  multiplication  by  scalars,  or  related  to  the 
ordinary  double  algebra  of  imaginary  quantities.  But  the  appearance 
of  a  single  one  of  these  systems  would  have  been  sufficient  to  mark 
an  epoch,  perhaps  the  most  important  epoch  in  the  history  of  the 
subject 

In  1853  and  1854,  Cauchy  published  several  memoirs  on  what 
he  called  clef 8  cdg4briques.§    These  were  units  subject  generally  to 

•  See  the  prefroe.  tSee|172. 

X  Oomptts  Rendu*,  vol.  xzi,  p.  620.  |  CampUt  Rendu*,  voii.  xxxvi,  ff. 
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combinatorial  multiplication.  His  principal  application  was  to  the 
theory  of  elimination.  In  this  application,  as  in  the  law  of  multi- 
plication, he  had  been  anticipated  by  Grassmann. 

We  come  next  to  Cayley's  celebrated  Memoir  on  the  Theory  of 
Matrices  *  in  1858,  of  which  Sylvester  has  said  that  it  seems  to  him 
to  have  ushered  in  the  reign,  of  Algebra  the  Second,  t  I  quote  this 
dictum  of  a  master  as  showing  his  opinion  of  the  importance  of  the 
subject  and  of  the  "memoir.  But  the  foundations  of  the  theory  of 
matrices,  regarded  as  multiple  quantities,  seem  to  me  to  have  been 
already  laid  in  the  Ausdehnungslehre  of  1844.  To  Qrassmann's 
treatment  of  this  subject  we  shall  recur  later. 

After  the  Ausdehnungslehre  of  1862,  already  mentioned,  we  come 
to  Hankers  Vorlesungen  iiber  die  complexen  Zahlen,  1867.  Under 
this  title  the  author  treats  of  the  imaginary  quantities  of  ordinary 
algebra,  of  what  he  calls  aUemirende  Zahlen,  and  of  quaternions. 
These  alternate  numbers,  like  Cauchy's  clefs,  are  quantities  subject 
to  Qrassmann's  law  of  combinatorial  multiplication.  This  treatise, 
published  twenty-three  years  after  the  first  Ausdehnungslehre,  marks 
the  first  impression  which  we  can  discover  of  Grassmann's  ideas  upon 
the  course  of  mathematical  thought  The  transcendent  importance 
of  these  ideas  was  fully  appreciated  by  the  author,  whose  very  able 
work  seems  to  have  had  considerable  influence  in  calling  the  attention 
of  mathematicians  to  the  subject. 

In  1870,  Professor  Benjamin  Peirce  published  his  Linear  Associative 
Algebra,  subsequently  developed  and  enriched  by  his  son,  Professor 
C.  S.  Peirce.  The  fact  that  the  edition  was  lithographed  seems  to 
indicate  that  even  at  this  late  date  a  work  of  this  kind  could  only 
be  regarded  as  addressed  to  a  limited  number  of  readers.  But  the 
increasing  interest  in  such  subjects  is  shown  by  the  republication  of 
this  memoir  in  1881,  {  as  by  that  of  the  first  Ausdehnungslehre  in 
1878. 

The  article  on  quaternions  which  has  just  appeared  in  the  Encyclo- 
pasdia  Britannica  mentions  twelve  treatises,  including  second  editions 
and  translations,  besides  the  original  treatises  of  Hamilton.  That  all 
the  twelve  are  later  than  1861  and  all  but  two  later  than  1872  shows 
the  rapid  increase  of  interest  in  this  subject  in  the  last  years. 

Finally,  we  arrive  at  the  Lectures  on  the  Principles  of  Universal 
Algebra  by  the  distinguished  foreigner  whose  sojourn  among  us  has 
given  such  an  impulse  to  mathematical  study  in  this  country.  The 
publication  of  these  lectures,  commenced  in  1884  in  the  American 
Journal  of  Mathematics,  has  not  as  yet  been  completed, — a  want  but 
imperfectly  supplied  by  the  author's  somewhat  desultory  publication 

*Phil.  Trans.,  voL  oxlviii.  i  Amer.  Journ.  Math.,  voL  vi,  p.  271. 

ZAtner.  Journ.  Math,,  vol.  iv. 
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of  many  remarkable  papers  on  the  same  subject  (which  might  be 
more  definitely  expressed  as  the  algebra  of  matrices)  in  various 
foreign  journals. 

It  is  not  an  accident  that  this  century  has  seen  the  rise  of  multiple 
algebra.  The  course  of  the  development  of  ideas  in  algebra  and  in 
geometry,  although  in  the  main  independent  of  any  aid  from  this 
source,  has  nevertheless  to  a  very  large  extent  been  of  a  character 
which  can  only  find  its  natural  expression  in  multiple  algebra. 

Our  Modern  Higher  Algebra  is  especially  occupied  with  the  theory 
of  linear  transformations.  Now  what  are  the  first  notions  which  we 
meet  in  this  theory  ?  We  have  a  set  of  n  variables,  say  xt  y,  z>  and 
another  set,  say  *',  y\  z\  which  are  homogeneous  linear  functions  of 
the  first,  and  therefore  expressible  in  terms  of  them  by  means  of  a 
block  of  ns  coefficients.  Here  the  quantities  occur  by  sets,  and  invite 
the  notations  of  multiple  algebra.  It  was  in  fact  shown  by  Grass- 
mann  in  his  first  Ausddinwnydekre  and  by  Cauchy  nine  years  later, 
that  the  notations  of  multiple  algebra  afford  a  natural  key  to  the 
subject  of  elimination. 

Now  I  do  not  merely  mean  that  we  may  save  a  little  time  or  space 
by  writing  perhaps  p  for  x,  y  and  z ;  p  for  x\  y'  and  z' ;  and  $  for  a 
block  of  n*  quantities.  But  I  mean  that  the  subject  as  usually  treated 
under  the  title  of  determinants  has  a  stunted  and  misdirected  develop- 
ment on  account  of  the  limitations  of  single  algebra.  This  will  appear 
from  a  very  simple  illustration.  After  a  little  preliminary  matter 
the  student  comes  generally  to  a  chapter  entitled  *  Multiplication  of 
Determinants,"  in  which  he  is  taught  that  the  product  of  the  deter- 
minants of  two  matrices  may  be  found  by  performing  a  somewhat 
lengthy  operation  on  the  two  matrices,  by  which  he  obtains  a  third 
matrix,  and  then  taking  the  determinant  of  this.  But  what  signifi- 
cance, what  value  has  this  theorem  ?  For  aught  that  appears  in  the 
majority  of  treatises  which  I  have  seen,  we  have  only  a  complicated 
and  lengthy  way  of  performing  a  simple  operation.  The  real  facts 
of  the  case  may  be  stated  as  follows : 

Suppose  the  set  of  n  quantities  p  to  be  derived  from  the  set  p  by 
the  matrix  $,  which  we  may  express  by 

P=$p; 
and  suppose  the  set  p"  to  be  derived  from  the  set  p  by  the  matrix  "¥",  i.e., 

and  /o"  =  ¥.#./>; 

it  is  evident  that  p"  can  be  derived  from  p  by  the  operation  of  a 

single  matrix,  say  9.  i.e., 

so  that  9  =  ¥.4>. 
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In  the  language  of  multiple  algebra  6  is  called  the  product  of 
¥  and  ♦.  It  is  of  course  interesting  to  see  bow  it  is  derived  from 
tbe  latter,  and  it  is  little  more  than  a  schoolboy's  exercise  to  determine 
this.  Now  this  matrix  6  baa  the  property  that  its  determinant 
is  equal  to  tbe  products  of  tbe  determinants  of  ¥  and  ♦.  And  this 
property  is  all  that  is  generally  stated  in  the  books,  and  tbe  funda- 
mental property,  which  is  all  that  gives  the  subject  its  interest,  that  0 
is  itself  the  product  of  ¥  and  $  in  the  language  of  multiple  algebra, 
i.e.,  that  operating  by  6  is  equivalent  to  operating  successively  by 
$  and  ¥,  is  generally  omitted.  The  chapter  on  this  subject,  in  most 
treatises  which  I  have  seen,  reads  very  like  the  play  of  Hamlet  with 
Hamlet's  part  left  out 

And  what  is  the  cause  of  this  omission  ?  Certainly  not  ignorance 
of  the  property  in  question.  The  fact  that  it  is  occasionally 
given  would  be  a  sufficient  bar  to  this  answer.  It  is  because 
the  author  fails  to  see  that  his  real  subject  is  matrices  and  not 
determinants.  Of  course,  in  a  certain  sense,  the  author  has  a  right 
to  choose  his  subject  But  this  does  not  mean  that  the  choice 
is  unimportant,  or  that  it  should  be  determined  by  chance  or  by 
caprice.  The  problem  well  put  is  half  solved,  as  we  all  know. 
If  one  chooses  the  subject  ill,  it  will  develop  itself  in  a  cramped 
manner. 

But  the  case  is  really  much  worse  than  I  have  stated  it  Not  only 
is  the  true  significance  of  the  formation  of  6  from  ¥  and  $  not  given, 
but  the  student  is  often  not  taught  to  form  the  matrix  which  is  the 
product  of  ¥  and  $,  but  one  which  is  the  product  of  one  of  these 
matrices  and  the  conjugate  of  the  other.  Thus  the  proposition  which 
is  proved  loses  all  its  simplicity  and  significance,  and  must  be  recast 
before  tbe  instructor  can  explain  its  true  bearings  to  the  student 
This  fault  has  been  denounced  by  Sylvester,  and  if  anyone  thinks 
I  make  too  much  of  the  standpoint  from  which  the  subject  is  viewed, 
I  will  refer  him  to  the  opening  paragraphs  of  the  "  Lectures  on 
Universal  Algebra"  in  the  sixth  volume  of  the  American  Journal  of 
Mathematics,  where,  with  a  wealth  of  illustration  and  an  energy 
of  diction  which  I  cannot  emulate,  the  most  eloquent  of  mathe- 
maticians expresses  his  sense  of  the  importance  of  the  substitution 
of  the  idea  of  the  matrix  for  that  of  the  determinant  If  then  so 
important,  why  was  the  idea  of  the  matrix  let  slip  ?  Of  course  the 
writers  on  this  subject  had  it  to  commence  with.  One  cannot  even 
define  a  determinant  without  the  idea  of  a  matrix.  The  simple  fact  is 
that  in  general  the  writers  on  this  subject  have  especially  developed 
those  ideas  which  are  naturally  expressed  in  simple  algebra,  and  have 
postponed  or  slurred  over  or  omitted  altogether  those  ideas  which 
find  their  natural  expression  in  multiple  algebra.    But  in  this  subject 
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the  latter  happen  to  be  the  fundamental  ideas,  and  those  which  ought 
to  direct  the  whole  course  of  thought. 

I  have  taken  a  very  simple  illustration,  perhaps  the  very  first 
theorem  which  meets  the  student  after  those  immediately  connected 
with  the  introductory  definitions,  both  because  the  simplest  illustration 
is  really  the  best,  and  because  I  am  here  most  at  home.  But  the 
principles  of  multiple  algebra  seem  to  me  to  shed  a  flood  of  light  into 
every  corner  of  the  subjects  usually  treated  under  the  title  of  deter- 
minants, the  subject  gaining  as  much  in  breadth  from  the  new  notions 
as  in  simplicity  from  the  new  notations;  and  in  the  more  intricate 
subjects  of  invariants,  covariants,  etc,  I  believe  that  the  principles  of 
multiple  algebra  are  ready  to  perform  an  equal  service.  Certainly 
they  make  many  things,  seem  very  simple  to  me  which  I  should 
otherwise  find  difficult  of  comprehension. 

Let  us  turn  to  geometry. 

If  we  were  asked  to  characterize  in  a  single  word  our  modern 
geometry,  we  would  perhaps  say  that  it  is  a  geometry  of  position. 
Now  position  is  essentially  a  multiple  quantity,  or,  if  you  prefer,  is 
naturally  represented  in  algebra  by  a  multiple  quantity.  And  the 
growth  in  this  century  of  the  so-called  synthetic  as  opposed  to 
analytical  geometry  seems  due  to  the  fact  that  by  the  ordinary 
analysis  geometers  could  not  easily  express,  except  in  a  cumbersome 
and  unnatural  manner,  the  sort  of  relations  in  which  they  were  par- 
ticularly interested.  With  the  introduction  of  the  notations  of  multiple 
algebra,  this  difficulty  falls  away,  and  with  it  the  opposition  between 
synthetic  and  analytical  geometry. 

It  is,  however,  interesting  and  very  instructive  to  observe  how  the 
ingenuity  of  mathematicians  has  often  triumphed  over  the  limitations 
of  ordinary  algebra.  A  conspicuous  example  and  one  of  the  simplest 
is  seen  in  the  M&anique  Analytique,  where  the  author,  by  the  use 
of  what  are  sometimes  called  indeterminate  equations,  is  able  to  write 
in  one  equation  the  equivalent  of  an  indefinite  number.  Thus  the 
equation 

X  dx  +  Ydy + Zdz = 0, 

by  the  indeterminateness  of  the  values  of  dx,  dy,  dz,  is  made  equiva- 
lent to  the  three  equations 

X=0,     F=0,    Z=0. 
It  is  instructive  to  compare  this  with 

Xi+Yf+Zk~09 
which  is  the  form  that  Hamilton  or  Grassmann  would  have  used. 
The  use  of  this  analytical  artifice,  if  such  it  can  be  called,  runs  all 
through  the  work  and  is  fairly  characteristic  of  it. 

Again,  the  introduction  of  the  potential  in  the  theory  of  gravity,  or 
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electricity,  or  magnetism,  gives  us  a  scalar  quantity  instead  of  a  vector, 
as  the  subject  of  study;  and  in  mechanics  generally  the  use  of  the 
force-function  substitutes  a  simple  quantity  for  a  complex.  This 
method  is  in  reality  not  different  from  that  just  mentioned,  since 
Lagrange's  indeterminate  equation  expresses,  at  least  in  its  origin, 
the  variation  of  the  force-function.  It  is  indeed  the  real  beauty  of 
Lagrange's  method  that  it  is  not  so  much  an  analytical  artifice,  as  the 
natural  development  of  the  subject 

In  modern  analytical  geometry  we  find  methods  in  use  which  are 
exceedingly  ingenious,  and  give  forms  curiously  like  those  of  multiple 
algebra,  but  which,  at  least  if  logically  carried  out  very  far,  are  exces- 
sively artificial,  and  that  for  the  expression  of  the  simplest  things. 
The  simplest  conceptions  of  the  geometry  of  three  dimensions  are 
points  and  planes,  and  the  simplest  relation  between  these  is  that  a 
point  lies  in  a  plane.  Let  us  see  how  these  notions  have  been  handled 
by  means  of  ordinary  algebra,  and  by  multiple  algebra.  It  will  illus- 
trate the  characteristic  difference  of  the  methods,  perhaps  as  well  as 
the  reading  of  an  elaborate  treatise. 

In  multiple  algebra  a  point  is  designated  by  a  single  letter,  just  as 
it  is  in  what  is  called  synthetic  geometry,  and  as  it  generally  is  by  the 
ordinary  analyst  when  he  is  not  writing  equations.  But  in  his 
equations,  instead  of  a  single  letter  the  analyst  introduces  several 
letters  (coordinates)  to  represent  the  point 

A  plane  may  be  represented  in  multiple  algebra  as  in  synthetic 
geometry  by  a  single  letter;  in  the  ordinary  algebra  it  is  sometimes 
represented  by  three  coordinates,  for  which  it  is  most  convenient  to 
take  the  reciprocals  of  the  segments  cut  off  by  the  plane  on  three  axes. 
But  the  modern  analyst  has  a  more  ingenious  method  of  representing 
the  plane.    He  observes  that  the  equation  of  the  plane  may  be  written 

fr+W+f«-l,  (1) 

where  £  9,  £  are  the  reciprocals  of  the  segments,  and  x,  y,  z  are  the 
coordinates  of  any  point  in  the  plane.    Now  if  we  set 

l>=£a;+*y+{*,  (2) 

this  letter  will  represent  an  expression  which  represents  the  plane.  In 
fact,  we  may  say  that  p  implicitly  contains  £  9,  and  f,  which  are  the 
coordinates  of  the  plane.  We  may  therefore  speak  of  the  plane  p,  and 
for  many  purposes  can  introduce  the  letter  p  into  our  equations  instead 
of  £  9,  £    For  example,  the  equation 

rji^ir  (3) 

is  equivalent  to  the  three  equations 

r-£±£     r-2^'.    r'^f-         <*> 
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.  It  is  to  be  noticed  that  on  account  of  the  indeterminatenesB  of  the 
x,  y,  and  z,  this  method,  regarded  as  an  analytical  artifice,  is  identical 
with  that  of  Lagrange,  also  that  in  multiple  algebra  we  should  have 
an  equation  of  precisely  the  same  form  as  (8)  to  express  the  same 
relation  between  the  planes,  but  that  the  equation  would  be  explained 
to  the  student  in  a  totally  different  manner.  This  we  shall  see  more 
particularly  hereafter. 

It  is  curious  that  we  have  thus  a  simpler  notation  for  a  plane  than 
for  a  point.  This  however  may  be  reversed.  If  we  commence  with 
the  notion  of  the  coordinates  of  a  plane,  £,  9,  f,  the  equation  of  a 
point  (i.e.,  the  equation  between  £  9,  £  which  will  hold  for  every 
plane  passing  through  the  point)  will  be 

where  xt  y,  z  are  the  coordinates  of  the  point    Now  if  we  set 

q=x£+yn+zl  (6) 

we  may  regard  the  single  letter  q  as  representing  the  point,  and  use 
it,  in  many  cases,  instead  of  the  coordinates  x,  y,  zt  which  indeed  it 
implicitly  contains.    Thus  we  may  write 

<»=i*f  <7) 

for  the  three  equations 

w^+-r.  jr-^t*;   *»J-+£-        (8) 

Here,  by  an  analytical  artifice,  we  come  to  equations  identical  in 
form  and  meaning  with  those  used  by  Hamilton,  Orassmann,  and  even 
by  Mtibius  in  1827.  But  the  explanations  of  the  formal©  would 
differ  widely.  The  methods  of  the  founders  of  multiple  algebra  are 
characterized  by  a  bold  simplicity,  that  of  the  modern  geometry  by  a 
somewhat  bewildering  ingenuity.  That  p  and  q  represent  the  same 
expression  (in  one  case  x,  y,  z,  and  in  the  other  £  9,  f  being  indeter- 
minate) is  a  circumstance  which  may  easily  become  perplexing.  I  am 
not  quite  certain  that  it  would  be  convenient  to  use  both  of  these 
abridged  notations  at  the  same  time.  In  fact,  if  the  geometer  using 
these  methods  were  asked  to  express  by  an  equation  in  p  and  q  that 
the  point  q  lies  in  the  plane  p,  he  might  find  himself  somewhat 
entangled  in  the  meshes  of  his  own  ingenuity,  and  need  some  new 
artifice  to  extricate  himself.  I  do  not  mean  that  his  genius  might 
not  possibly  be  equal  to  the  occasion,  but  I  do  mean  very  seriously 
that  it  is  a  vicious  method  which  requires  any  ingenuity  or  any 
artifice  to  express  so  simple  a  relation. 

If  we  use  the  methods  of  multiple  algebra  which  are  most  com- 
parable to  those  just  described,  a  point  is  naturally  represented  by  a 
vector  (/>)  drawn  to  it  from  the  origin,  a  plane  by  a  vector  (cr)  drawn 
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from  the  origin  perpendicularly  toward  the  plane  and  in  length  equal 
to  the  reciprocal  of  the  distance  of  the  plane  from  the  origin.  The 
equation  / ,    // 

*  ■"" r~ 

will  have  precisely  the  same  meaning  as  equation  (3),  and 

,»'=£+£"  (10) 

will  have  precisely  the  same  meaning  as  equation  (7),  vis.,  that  the 
point  ft"  is  in  the  middle  between  p  and  p".  That  the  point  p  lies  in 
the  plane  <r  is  expressed  by  equating  to  unity  the  product  of  p  and  a- 
called  by  Qrassmann  internal,  or  by  Hamilton  called  the  scalar  part 
of  the  product  taken  negatively.  By  whatever  name  called,  the 
quantity  in  question  is  the  product  of  the  lengths  of  the  vectors  and 
the  cosine  of  the  included  angle.  It  is  of  course  immaterial  what 
particular  sign  we  use  to  express  this  product,  as  whether  we  write 

/<Mr=l,        or        #pa=— l.  (II) 

I  should  myself  prefer  the  simplest  possible  sign  for  so  simple  a 
relation.  It  may  be  observed  that  p  and  <r  may  be  expressed  as  the 
geometrical  sum  of  their  components  parallel  to  a  set  of  perpendicular 
axes,  via.. 

p~xi+yj+zk>        cr=£i+V+{*.  (12) 

By  substitution  of  these  values,  equation  (11)  becomes  by  the  lawB  of 
this  kind  of  multiplication 

My  object  in  going  over  these  elementary  matters  is  to  call  attention 
to  the  very  roundabout  way  in  which  the  ordinary  analysis  makes 
out  to  represent  a  point  or  a  plane  by  a  single  letter,  as  distinguished 
from  the  directness  and  simplicity  of  the  notations  of  multiple  algebra, 
and  also  to  the  fact  that  the  representations  of  points  and  planes  by 
single  letters  in  the  ordinary  analysis  are  not,  when  obtained,  as 
amenable  to  analytical  treatment  as  are  the  notations  of  multiple 
algebra. 

I  have  compared  that  form  of  the  ordinary  analysis  which  relates 
to  Cartesian  axes  with  a  vector  analysis.  But  the  case  is  essentially 
the  same  if  we  compare  the  form  of  ordinary  analysis  which  relates 
to  a  fundamental  tetrahedron  with  Orassmann's  geometrical  analysis, 
founded  on  the  point  as  the  elementary  quantity. 

In  the  method  of  ordinary  analysis  a  point  is  represented  by  four 
coordinates,  of  which  each  represents  the  distance  of  the  point  from 
a  plane  of  the  tetrahedron  divided  by  the  distance  of  the  opposite 
vertex  from  the  same  plane.  The  equation  of  a  plane  may  be  put 
in  the  form 

£*+*y+{*+«H0-O,  (14) 
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where  £,  9,  £, «»  are  the  distances  of  the  plane  from  the  four  points, 
and  xy  y,  ztw  are  the  coordinates  of  any  point  in  the  plane.  Here 
we  may  set 

p=€x+tiy+{z+<*u>,  (15) 

and  say  that  p  represents  the  plane.  To  some  extent  we  can  intro- 
duce this  letter  into  equations  instead  of  £  tj,  {,  «.    Thus  the  equation 

lp'+mp"+np'"=Q  (16) 

(which  denotes  that  the  planes  p't  p"t  p'">  meet  in  a  common  line, 
making  angles  of  which  the  sines  are  proportional  to  lt  m,  and  n)  is 
equivalent  to  the  four  equations 

lf+m£*+*f'a'0>        V+m^+n^T-0,    etc.  (17) 

Again,  we  may  regard  £  9,  f  ,  *  as  the  coordinates  of  a  plane.  The 
equation  of  a  point  will  then  be 

*£+y*+*f+wt»«0.  (18) 
If  we  set 

q*=x(+yn+z£+w<*>  (19) 

we  may  say  that  q  represents  the  point    The  equation 

q»'=i+£,  (20) 

which  indicates  that  the  point  g"'  bisects  the  line  between  qf  and  q", 
is  equivalent  to  the  four  equations 

f-£±£',        t-'tSf,  etc  (21) 

To  express  that  the  point  q  lies  in  the  plane  p  does  not  seem  easy, 
without  going  back  to  the  use  of  coordinates. 

The  form  of  multiple  algebra  which  is  to  be  compared  to  this  is  the 
geometrical  algebra  of  Mttbius  and  Qrassmann,  in  which  points  without 
reference  to  any  origin  are  represented  by  single  letters,  say  by  Italic 
capitals,  and  planes  may  also  be  represented  by  single  letters,  say  by 
Greek  capitals.    An  equation  like 

<?»=£££,  (22) 

has  exactly  the  same  meaning  as  equation  (20)  of  ordinary  algebra.    So 

m'+mlT+ttir-O  (23) 

has  precisely  the  same  meaning  as  equation  (16)  of  ordinary  algebra. 
That  the  point  Q  lies  in  the  plane  II  is  expressed  by  equating  to  zero 
the  product  of  Q  and  II  which  is  called  by  Qrassmann  external  and 
which  might  be  defined  as  the  distance  of  the  point  from  the  plane. 
We  may  write  this 

QxH  =  0.  (24) 
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To  show  that  so  simple  an  expression  is  really  amenable  to  analytical 
treatment,  I  observe  that  Q  may  be  expressed  in  terms  of  any  four 
points  (not  in  the  same  plane)  on  the  baiycentric  principle  explained 
above,  viz.,  Q=mA+yB+zC+wD9  (25) 

and  II  may  be  expressed  in  terms  of  combinatorial  products  of 
A,  B,  C,  and  D,  via., 

and  by  these  substitutions,  by  the  laws  of  the  combinatorial  product 
to  be  mentioned  hereafter,  equation  (24)  is  transformed  into 

wm+x£+yn+zt=0,  (27) 

which  is  identical  with  the  formula  of  ordinary  analysis.* 

I  have  gone  at  length  into  this  very  simple  point  in  order  to 
illustrate  the  fact,  which  I  think  is  a  general  one,  that  the  modern 
geometry  is  not  only  tending  to  results  which  are  appropriately 
expressed  in  multiple  algebra,  but  that  it  is  actually  striving  to  clothe 
itself  in  forms  which  are  remarkably  similar  to  the  notations  of 
multiple  algebra,  only  less  simple  and  general  and  far  less  amenable 
to  analytical  treatment,  and  therefore,  that  a  certain  logical  necessity 
calls  for  throwing  off  the  yoke  under  which  analytical  geometry  has 
so  long  labored  And  lest  this  should  seem  to  be  the  utterance  of  an 
uninformed  enthusiasm,  or  the  echoing  of  the  possibly  exaggerated 
claims  of  the  devotees  of  a  particular  branch  of  mathematical  study, 
I  will  quote  a  sentence  from  Clebsch  and  one  from  Clifford,  relating 
to  the  past  and  to  the  future  of  multiple  algebra.  The  former  in 
his  eulogy  on  Pltieker,t  in  1871,  speaking  of  recent  advances  in 
geometry,  says  that  "in  a  certain  sense  the  coordinates  of  a  straight 
•  line,  and  in  general  a  great  part  of  the  fundamental  conceptions  jof 
the  newer  algebra,  are  contained  in  the  Auadehnungslehre  of  1844," 
and  Clifford  J  in  the  last  year  of  his  life,  speaking  of  the  Auadehn- 
wngdehn,  with  which  he  had  but  recently  become  acquainted, 
expresses  "his  profound  admiration  of  that  extraordinary  work,  and 
his  conviction  that  its  principles  will  exercise  a  vast  influence  upon 
the  future  of  mathematical  science.1' 

Another  subject  in  which  we  find  a  tendency  toward  the  forms 
and  methods  of  multiple  algebra,  is  the  calculus  of  operations.  Our 
ordinary  analysis  introduces  operators,  and  the  successive  operations 
A  and  B  may  be  equivalent  to  the  operation  (7.  To  express  this  in 
an  equation  we  may  write 
BA(x)=C(x)t 

•The  letters  £,  9,  f,  «*,  here  denote  the  distances  of  the  plane  II  from  the  points 
Ay  Bt  Of  D,  divided  by  six  times  the  volume  of  the  tetrahedron,  A,  B,  O,  D.  The 
letters  x,  y,  z,  w,  denote  the  tetrahedral  coordinates  as  above. 

+  G0U.  Abhandl.,  vol.  xvi,  p.  28.  %Amer.  Sown.  Math.,  vol.  i,  p.  300. 
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where  x  is  any  quantity  or  function.  We  may  also  have  occasion  to 
^^  A(x)+B(x)  =  D{x)t    or    (A+B)(x)  =  D(x). 

But  it  is  almost  impossible  to  resist  the  tendency  to  express  these 
relations  in  the  form  ^  =  C 

A+B  =  Dt 

in  which  the  operators  appear  in  a  sense  as  quantities,  Le.,  as  subjects 
of  functional  operation.  Now  since  these  operators  are  often  of  such 
nature  that  they  cannot  be  perfectly  specified  by  a  single  numerical 
quantity,  when  we  treat  them  as  quantities  they  must  be  regarded  as 
multiple  quantities.  In  this  way  certain  formula)  which  essentially 
belong  to  multiple  algebra  get  a  precarious  footing  where  they  are 
only  allowed  because  they  are  regarded  as  abridged  notations  for 
equations  in  ordinary  algebra.  Tet  the  logical  development  of  such 
notations  would  lead  a  good  way  in  multiple  algebra,  and  doubtless 
many  investigators  have  entered  the  field  from  this  side. 

One  might  also  notice,  to  show  how  the  ordinary  algebra  is  be- 
coming saturated  with  the  notions  and  notations  which  seem  destined 
to  turn  it  into  a  multiple  algebra,  the  notation  so  common  in  the 
higher  algebra  (a,  b,  c)(x,  y,  z) 

for  ax+by+cz. 

This  is  evidently  the  same  as  Qrassmann's  internal  product  of  the 
multiple  quantities  (a,  6,  c)  and  (xt  y,  z),  or,  in  the  language  of 
quaternions,  the  scalar  part,  taken  negatively,  of  the  product  of  the 
vectors  of  which  a,  6,  c  and  x,  y,  z  are  the  components.  A  similar 
correspondence  with  Qrassmann's  methods  might,  I  think,  be  shown 
in  such  notations  as,  for  example, 

(a,  6,  c,  d)(xf  yf. 

The  free  admission  of  such  notations  is  doubtless  due  to  the  fact  that 
they  are  regarded  simply  as  abridged  notations. 

The  author  of  the  celebrated  "Memoir  on  the  Theory  of  Matrices" 
goes  much  farther  than  this  in  his  use  of  the  forms  of  multiple 
algebra.  Thus  he  writes  explicitly  one  equation  to  stand  for  several, 
without  the  use  of  any  of  the  analytical  artifices  which  have  been 
mentioned.  This  work  has  indeed,  as  we  have  seen,  been  characterized 
as  marking  the  commencement  of  multiple  algebra, — a  view  to  which 
we  can  only  take  exception  as  not  doing  justice  to  earlier  writers. 

But  the  significance  of  this  memoir  with  regard  to  the  point  which 
I  am  now  considering  is  that  it  shows  that  the  chasm  so  marked 
in  the  second  quarter  of  this  century  is  destined  to  be  closed  up. 
Notions  and  notations  for  which  a  Cayley  is  sponsor  will  not  be 
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excluded  from  good  society  among  mathematicians.  And  if  we  admit 
as  suitable  the  notations  used  in  this  memoir  (where  it  is  noticeable 
that  the  author  rather  avoids  multiple  algebra,  and  only  uses  it  very 
sparingly),  we  shall  logically  be  brought  to  use  a  great  deal  more. 
For  example,  if  it  id  a  good  thing  to  write  in  our  equations  a  single 
letter  to  represent  a  matrix  of  n%  numerical  quantities,  why  not  use 
a  single  letter  to  represent  the  n  quantities  operated  upon,  as  Qrass- 
mann  and  Hamilton  have  done?  Logical  consistency  seems  to  de- 
mand it  And  if  we  may  use  the  sign  X  to  denote  an  operation  by 
which  two  seta  of  quantities  are  combined  to  form  a  third  set,  as  is 
the  case  in  this  memoir,  why  not  use  other  signs  to  denote  other 
functional  operations  of  which  the  result  is  a  multiple  quantity? 
If  it  be  conceded  that  this  is  the  proper  method  to  follow  where 
simplicity  of  conception,  or  brevity  of  expression,  or  ease  of  trans- 
formation is  served  thereby,  our  algebra  will  become  in  large  part  a 
multiple  algebra. 

We  have  considered  the  subject  a  good  while  from  the  outside ;  we 
have  glanced  at  the  principal  events  in  the  history  of  multiple  algebra; 
we  have  seen  how  the  course  of  modern  thought  seems  to  demand  its 
aid,  how  it  is  actually  leaning  toward  it,  and  beginning  to  adopt  its 
methods.  It  may  be  worth  while  to  direct  our  attention  more 
critically  to  multiple  algebra  itself,  and  inquire  into  its  essential 
character  and  its  most  important  principles. 

I  do  not  know  that  anything  useful  or  interesting,  which  relates 
to  multiple  quantity,  and  can  be  symbolically  expressed,  falls  outside 
of  the  domain  of  multiple  algebra.  But  if  it  is  asked,  what  notions 
are  to  be  regarded  as  fundamental,  we  must  answer,  here  as  else- 
where, those  which  are  most  simple  and  fruitful  Unquestionably,  no 
relations  are  more  so  than  those  which  are  known  by  the  names 
of  addition  and  multiplication. 

Perhaps  I  should  here  notice  the  essentially  different  manner  in 
which  the  multiplication  of  multiple  quantities  has  been  viewed 
by  different  writers.  Some,  as  Hamilton,  or  De  Morgan,  or  Peirce, 
speak  of  the  product  of  two  multiple  quantities,  as  if  only  one  product 
could  exist,  at  least  in  the  same  algebra.  Others,  as  Qrassmann,  speak 
of  various  kinds  of  products  for  the  same  multiple  quantities.  Thus 
Hamilton  seems  for  many  years  to  have  agitated  the  question,  what 
he  should  regard  as  the  product  of  each  pair  of  a  set  of  triplets,  or 
in  the  geometrical  application  of  the  subject,  what  he  should  regard 
as  the  product  of  each  pair  of  a  system  of  perpendicular  directed 
lines*  Qrassmann  asks,  What  products,  ie.,  what  distributive 
functions  of  the  multiple  quantities,  are  most  important? 

•PkiL  Mag.  (S),  roL  xxr,  p.  4SS;  North  Brituh  RevUw,  vol  xlv,  (1S6S),  p.  07. 
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It  may  be  that  in  some  cases  the  fact  that  only  one  kind  of  product 
is  known  in  ordinary  algebra  has  led  those  to  whom  the  problem 
presented  itself  in  the  form  of  finding  a  new  algebra  to  adopt  this 
characteristic  derived  from  the  old  Perhaps  the  reason  lies  deeper 
in  a  distinction  like  that  in  arithmetic  between  concrete  and  abstract 
numbers  or  quantities.  The  multiple  quantities  corresponding  to 
concrete  quantities  such  as  ten  apples  or  three  miles  are  evidently 
such  combinations  as  ten  apples  +  seven  oranges,  three  miles  north- 
ward +  five  miles  eastward,  or  six  miles  in  a  direction  fifty  degrees 
east  of  north.  Such  are  the  fundamental  multiple  quantities  from 
Grassmann's  point  of  view.  But  if  we  ask  what  it  is  in  multiple 
algebra  which  corresponds  to  an  abstract  number  like  twelve,  which 
is  essentially  an  operator,  which  changes  one  mile  into  twelve  miles, 
and  SI  ,000  into  SI  2,000,  the  most  general  answer  would  evidently 
be:  an  operator  which  will  work  such  changes  as,  for  example,  that 
of  ten  apples  +  seven  oranges  into  fifty  apples  +  100  oranges,  or  that 
of  one  vector  into  another. 

Now  an  operator  has,  of  course,  one  characteristic  relation,  vie.,  its 
relation  to  the  operand.  This  needs  no  especial  definition,  since  it 
is  contained  in  the  definition  of  the  operator.  If  the  operation 
is  distributive,  it  may  not  inappropriately  be  called  multiplication, 
and  the  result  is  fwr  excellence  the  product  of  the  operator  and 
operand.  The  sum  of  operators  qud  operators,  is  an  operator  which 
gives  for  the  product  the  sum  of  the  products  given  by  the  operators 
to  be  added.  The  product  of  two  operators  is  an  operator  which  is 
equivalent  to  the  successive  operations  of  the  factors.  This  multi- 
plication is  necessarily  associative,  and  its  definition  is  not  really 
different  from  that  of  the  operators  themselves.  And  here  I  may 
observe  that  Professor  C.  S.  Peirce  has  shown  that  his  father's 
associative  algebras  may  be  regarded  as  operational  and  matricular.* 

Now  the  calculus  of  distributive  operators  is  a  subject  of  great 
extent  and  importance,  but  Grassmann's  view  is  the  more  compre- 
hensive, since  it  embraces  the  other  with  something  besides.  For 
every  quantitative  operator  may  be  regarded  as  a  quantity,  i.e.,  as 
the  subject  of  mathematical  operation,  but  every  quantity  cannot 
be  regarded  as  an  operator ;  precisely  as  in  grammar  every  verb  may 
be  taken  as  substantive,  as  in  the  infinitive,  while  every  substantive 
does  not  give  us  a  verb. 

Grassmann's  view  seems  also  the  most  practical  and  convenient 
For  we  often  use  many  functions  of  the  same  pair  of  multiple 
quantities,  which  are  distributive  with  respect  to  both,  and  we  need 
some  simple  designation  to  indicate  a  property  of  such  fundamental 


*  Amtr.  Jowth.  Math.,  vol.  iv,  p.  221. 
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importance  in  the  algebra  of  such  functions,  and  no  advantage 
appears  in  singling  out  a  particular  function  to  be  alone  called 
the  product  Even  in  quaternions,  where  Hamilton  speaks  of 
only  one  product  of  two  vectors  (regarding  it  as  a  special  case 
of  the  product  of  quaternions,  Le.,  of  operators),  he  nevertheless 
comes  to  use  the  scalar  part  of  this  product  and  the  vector  part 
separately.  Now  the  distributive  law  is  satisfied  by  each  of  these, 
which  therefore  may  conveniently  be  called  products.  In  this 
sense  we  have  three  kinds  of  products  of  vectors  in  Hamilton's 
analysis. 

Let  us  then  adopt  the  more  general  view  of  multiplication,  and  call 
any  function  of  two  or  more  multiple  quantities,  which  is  distributive 
with  respect  to  all,  a  product,  with  only  this  limitation,  that  when 
one  of  the  factors  is  simply  an  ordinary  algebraic  quantity,  its  effect 
is  to  be  taken  in  the  ordinary  sense. 

It  is  to  be  observed  that  this  definition  of  multiplication  implies 
that  we  have  an  addition  both  of  the  kind  of  quantity  to  which  the 
product  belongs,  and  of  the  kinds  to  which  the  factors  belong.  Of 
course,  these  must  be  subject  to  the  general  formal  laws  of  addition. 
I  do  not  know  that  it  is  necessary  for  the  purposes  of  a  general 
discussion  to  stop  to  define  these  operations  more  particularly,  either 
on  their  own  account  or  to  complete  the  definition  of  multiplication. 
Algebra,  as  a  formal  science,  may  rest  on  a  purely  formal  foundation. 
To  take  our  illustration  again  from  mechanics,  we  may  say  that 
if  a  man  is  inventing  a  particular  machine, — a  sewing  machine,  a 
reaper, — nothing  is  more  important  than  that  he  should  have  a  precise 
idea  of  the  operation  which  his  machine  is  to  perform,  yet  when  he  is 
treating  the  general  principles  of  mechanics  he  may  discuss  the  lever, 
or  the  form  of  the  teeth  of  wheels  which  will  transmit  uniform 
motion,  without  inquiring  the  purpose  to  which  the  apparatus  is  to 
be  applied ;  and  in  like  manner  that  if  we  were  forming  a  particular 
algebra, — a  geometrical  algebra,  a  mechanical  algebra,  an  algebra 
for  the  theory  of  elimination  and  substitution,  an  algebra  for  the 
study  of  quantics, — we  should  commence  by  asking,  What  are  the 
multiple  quantities,  or  sets  of  quantities,  which  we  have  to  consider  ? 
What  are  the  additive  relations  between  them  ?  What  are  the  multi- 
plicative relations  between  them?  etc,  forming  a  perfectly  defined 
and  complete  idea  of  these  relations  as  we  go  along-;  but  in  the 
development  of  a  general  algebra  no  such  definiteness  of  conception 
is  requisite.  Qiven  only  the  purely  formal  law  of  the  distributive 
character  of  multiplication, — this  is  sufficient  for  the  foundation  of  a 
science.  Nor  will  such  a  science  be  merely  a  pastime  for  an  ingenious 
mind.  It  will  serve  a  thousand  purposes  in  the  formation  of  parti- 
cular algebras.     Perhaps  we  shall  find  that  in  the  most  important 
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cases  the  particular  algebra  is  little  more  than  an  application  or 
interpretation  of  the  general. 

Grassmann  observes  that  any  kind  of  multiplication  of  n-fold 
quantities  is  characterized  by  the  relations  which  hold  between  the 
products  of  n  independent  unite.  In  certain  kinds  of  multiplication 
these  characteristic  relations  will  hold  true  of  the  products  of  any 
of  the  quantities. 

Thus  if  the  value  of  a  product  is  independent  of  the  order  of  the 
factors  when  these  belong  to  the  system  of  units,  it  will  always  be 
independent  of  the  order  of  the  factors.  The  kind  of  multiplication 
characterized  by  this  relation  and  no  other  between  the  products  is 
called  by  Grassmann  algebraic,  because  its  rules  coincide  with  those 
of  ordinary  algebra.  It  is  to  be  observed,  however,  that  it  gives 
rise  to  multiple  quantities  of  higher  orders.    If  n  independent  units 

11.4-1 

are  required  to  express  the  original  quantities,  n— 3—  units  will  be 

required  for  the  products  of  two  factors,  n- g^g for  the 

products  of  three  factors,  etc. 

Again,  if  the  value  of  a  product  of  factors  belonging  to  a  system 

of  units  is  multiplied  by  —1  when  two  factors  change  places,  the 

same  Till  be  true  of  the  product  of  any  factors  obtained  by  addition 

of  the  unite.    The  kind  of  multiplication  characterized  by  this  relation 

and  no  other  is  called  by  Grassmann  external  or  cvmMnatorial.    For 

our  present  purpose  we  may  denote  it  by  the  sign  x .    It  gives  rise 

n— 1 
to  multiple  quantities  of  higher  orders,  n  —5—  units  being  required 

to  express  the  products  of  two  factors,  n^   ~     ^  —  -  units  for 

products  of  three  factors,  etc.  All  products  of  more  than  n  factors 
are  zero.  The  products  of  n  factors  may  be  expressed  by  a  single 
unit,  viz.,  the  product  of  the  n  original  units  taken  in  a  specified 
order,  which  is  generally  set  equal  to  1.  The  products  of  n— 1  factors 
are  expressed  in  terms  of  n  units,  those  of  n— 2  factors  in  terms  of 

n  —5 —  units,  etc    This  kind  of  multiplication  is  associative,  like  the 

algebraic 

Grassmann  observes,  with  respect  to  binary  products,  that  these 
two  kinds  of  multiplication  are  the  only  kinds  characterized  by  laws 
which  are  the  same  for  any  factors  as  for  particular  units,  except 
indeed  that  characterized  by  no  special  laws,  and  that  for  which  all 
products  are  zero*  The  last  we  may  evidently  reject  as  nugatory. 
That  for  which  there  are  no  special  laws,  Le.,  in  which  no  equations 

'Oelle'tSofim./.  Math.,  vol  zlix,  p.  188. 
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subsist  between  the  products  of  a  system  of  independent  units,  is  also 
rejected  by  Qrassmann,  as  not  appearing  to  afford  important  applica- 
tions. I  shall,  however,  have  occasion  to  speak  of  it,  and  shall  call 
it  the  indeterminate  product  In  this  kind  of  multiplication,  n*  units 
are  required  to  express  the  products  of  two  factors,  and  n*  units  for 
products  of  three  factors,  etc  It  evidently  may  be  regarded  as 
associative. 

Another  very  important  kind  of  multiplication  is  that  called  by 
Qrassmann  internal.  In  the  form  in  which  I  shall  give  it,  which  is 
less  general  than  Qrassmann's,  it  is  in  one  respect  the  most  simple  of 
all,  since  its  only  result  is  a  numerical  quantity.  It  is  essentially 
binary  and  characterized  by  laws  of  the  form 

i.i=l,       jV«1,        k.k=l9    etc, 
i;=0,       i-i^O,  etc, 

where  i,j,  Jfe,  etc,  represent  a  system  of  independent  units.  I  use  the 
dot  as  significant  of  this  kind  of  multiplication. 

Qrassmann  derives  this  kind  of  multiplication  from  the  com- 
binatorial by  the  following  process.  He  defines  the  complement 
(Ergftnsung)  of  a  unit  as  the  combinatorial  product  of  all  the  other 
units,  taken  with  such  a  sign  that  the  combinatorial  product  of  the 
unit  and  its  complement  shall  be  positive.  The  combinatorial  product 
of  a  unit  and  its  complement  is  therefore  unity,  and  that  of  a  unit 
and  the  complement  of  any  other  unit  is  aero.  The  internal  product 
of  two  units  is  the  combinatorial  product  of  the  first  and  the  com- 
plement of  the  second. 

It  is  important  to  observe  that  any  scalar  product  of  two  factors  of 
the  same  kind  of  multiple  quantities,  which  is  positive  when  the 
factors  are  identical,  may  be  regarded  as  an  internal  product,  Le.,  we 
may  always  find  such  a  system  of  units,  that  the  characteristic 
equations  of  the  product  will  reduce  to  the  above  form.  The  nature 
of  the  subject  may  afford  a  definition  of  the  product  independent  of 
any  reference  to  a  system  of  units.  Such  a  definition  will  then  have 
obvious  advantages.  An  important  case  of  this  kind  occurs  in 
geometry  in  that  product  of  two  vectors  which  is  obtained  by  multi- 
plying the  products  of  their  lengths  by  the  cosine  of  the  angle  which 
they  include    This  is  an  internal  product  in  Grassmann's  sense. 

Let  us  now  return  to  the  indeterminate  product,  which  I  am 
inclined  to  regard  as  the  most  important  of  all,  since  we  may  derive 
from  it  the  algebraic  and  the  combinatorial.  For  this  end  we  will 
prefix  2  to  an  indeterminate  product  to  denote  the  sum  of  all  the 
terms  obtained  by  taking  the  factors  in  every  possible  order.    Then, 

2«|)8|y, 
for    instance,   where   the    vertical    line    is  used    to    denote    the 
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indeterminate  product  *  is  a  distributive  function  of  a,  )8,  and  y.  It 
is  evidently  not  affected  by  changing  the  order  of  the  letters.  It  is, 
therefore,  an  algebraic  product  in  the  sense  in  which  the  term  has 
been  defined. 

So,  again,  if  we  prefix  2d:  to  an  indeterminate  product  to  denote 
the  sum  of  all  terms  obtained  by  giving  the  factors  every  possible 
order,  those  terms  being  taken  negatively  which  are  obtained  by  an 
odd  number  of  simple  permutations, 

2±a|j3|y, 
for  instance,  will  be  a  distributive  function  of  a,  j3,  y,  which  is 
multiplied  by  —1  when  two  of  these  letters  change  places.     It  will 
therefore  be  a  combinatorial  product. 

It  is  a  characteristic  and  very  important  property  of  an  indeter- 
minate product  that  every  product  of  all  its  factors  with  any  other 
quantities  is  also  a  product  of  the  indeterminate  product  and  the 
other  quantities.  We  need  not  stop  for  a  formal  proof  of  this  pro- 
position, which  indeed  is  an  immediate  consequence  of  the  definitions 
of  the  terms. 

These  considerations  bring  us  naturally  to  what  Qrassmann  calls 
regressive  muUiplication,  which  I  will  first  illustrate  by  a  very 
simple  example.  If  n,  the  degree  of  multiplicity  of  our  original 
quantities,  is  4,  the  combinatorial  product  of  aXjSxy  and  Sxe,  viz., 

aX)8xyX<$xe, 
is  necessarily  zero,  since  the  number  of  factors  exceeds  four.    But  if 
for  Sxe  we  set  its  equivalent 

*|t-«|«, 
we  may  multiply  the  first  factor  in  each  of  these  indeterminate  pro- 
ducts combinatorially  by  aXjSxy,  and  prefix  the  result,  which  is  a 
numerical  quantity,  as  coefficient  to  the  second  factor.    This  will  give 

(aX#Xyx£)e-(aXj8xyX£)6\ 
Now,  the  first  term  of  this  expression  is  a  product  of  aX/Sxy,  8,  and 
e,  and  therefore,  by  the  principle  just  stated,  a  product  of  aX#Xy 
and  8\e.  The  second  term  is  a  similar  product  of  aXjSxy  and  e\8. 
Therefore  the  whole  expression  is  a  product  of  aX/Sxy  and  S\e—e\8> 
that  is,  of  aXjSxy  and  8xe.  That  is,  except  in  sign,  what  Qrass- 
mann calls  the  regressive  product  of  axfixy  and  8xe. 

To  generalize  this  process,  we  first  observe  that  an  expression  of 
the  form 

2  ±aX/3\yx8, 

in  which  each  term  is  an  indeterminate  product  of  two  combinatorial 
products,  and  in  which  2±  denotes  the  sum  of  all  terms  obtained  by 

*  This  notation  must  not  be  oonfounded  with  Grftssmann's  use  of  the  rertioel  line. 
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putting  every  different  pair  of  the  letters  before  the  dividing  line,  the 
negative  sign  being  used  for  any  terms  which  may  be  obtained  by  an 
odd  number  of  simple  permutations  of  the  letters, — in  other  words, 
the  expression 

aXj8|yX3-aXy|j8x£-aXa|yXj8+|8xy|aX<$ 
-j8xa|aXy+yX£|aX)8, 

is  a  distributive  function  of  a,  /8,  y,  and  8,  which  is  multiplied  by  —  1 
when  two  of  these  letters  change  places,  and  may,  therefore,  be  regarded 
as  equivalent  to  the  combinatorial  product  aX/8xyX<$.  Now,  if  ti  =  5, 
the  combinatorial  product  of 

pX<rXr        and         aXfiXyXS 

is  zero.  But  if  we  multiply  the  first  member  of  each  of  the  above 
indeterminate  products  by  />X<tXt,  and  prefix  the  result  as  co- 
efficient to  the  second  member,  we  obtain 

(pX<rXTXaXj8)yX£-(pX(rXTXaXy)/8x$+etc., 

which  is  what  Grassmann  calls  the  regressive  product  of  />X<tXt  and 
aXfiXyxS.  It  is  easy  to  see  that  the  principle  may  be  extended  so 
as  to  give  a  regressive  product  in  any  case  in  which  the  total  number 
of  factors  of  two  combinatorial  products  is  greater  than  w.  Also,  that 
we  might  form  a  regressive  product  by  treating  the  first  of  the  given 
combinatorials  as  we  have  treated  the  second.  It  may  easily  be  shown 
that  this  would  give  the  same  result,  except  in  some  cases  with  a 
difference  of  sign.  To  avoid  this  inconvenience,  we  may  make  the 
rule,  that  whenever  in  the  substitution  of  a  sum  of  indeterminate 
products  for  a  combinatorial,  both  factors  of  theindeterminate  products 
are  of  odd  degree,  we  change  the  sign  of  the  whole  expression.  With 
this  understanding,  the  results  which  we  obtain  will  be  identical  with 
Grassmann's  regressive  product.  The  propriety  of  the  name  consists 
in  the  fact  that  the  product  is  of  less  degree  than  either  of  the  factors* 
For  the  contrary  reason,  the  ordinary  external  or  combinatorial 
multiplication  is  sometimes  called  by  Grassmann  progressive. 

Regressive  multiplication  is  associative  and  exhibits  a  very  remark- 
able analogy  with  the  progressive.  This  analogy  I  have  not  time 
here  to  develop,  but  will  only  remark  that  in  this  analogy  lies  in  its 
most  general  form  that  celebrated  principle  of  duality,  which  appears 
in  various  forms  in  geometry  and  certain  branches  of  analysis. 

To  fix  our  ideas,  I  may  observe  that  in  geometry  the  progressive 
multiplication  of  points  gives  successively  lines,  planes  and  volumes ; 
the  regressive  multiplication  of  planes  gives  successively  lines,  points 
and  scalar  quantities. 

The  indeterminate  product  affords  a  natural  key  to  the  subject  of 
matrices.    In  fact,  a  sum  of  indeterminate  products  of  the  second 
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degree  represents  n*  scalars,  which  constitute  an  ordinary  or  quadratic 
matrix ;  a  sum  of  indeterminate  products  of  the  third  degree  repre- 
sents n*  scalars,  which  constitute  a  cubic  matrix,  etc.  I  shall  confine 
myself  to  the  simplest  and  most  important  case,  that  of  quadratic 
matrices. 
An  expression  of  the  form 

a(X.p) 

being  a  product  of  a,  X,  and  p,  may  be  regarded  as  a  product  of  «|X 
and  />,  by  a  principle  already  stated.  Now  if  $  denotes  a  sum  of 
indeterminate  products,  of  second  degree,  say  a|X+#|yu+etc,  we 
may  write 

for  a(\.p)+/3(fi.p)+ete. 

This  is  like  />,  a  quantity  of  the  first  degree,  and  it  is  a  homogeneous 
linear  function  of  p.  It  is  easy  to  see  that  the  most  general  form 
of  such  a  function  may  be  expressed  in  this  way.    An  equation  like 

represents  n  equations  in  ordinary  algebra,  in  which  n  variables  are 
expressed  as  linear  functions  of  n  others  by  means  of  n2  coefficients. 

The  internal  product  of  two  indeterminate  products  may  be  defined 
by  the  equation  (a|0).(y|*)-(j8.y)a|A 

This  defines  the  internal  product  of  matrices,  as 

This  product  evidently  gives  a  matrix,  the  operation  of  which  is 
equivalent  to  the  successive  operations  of  $  and  ¥ ;  i.e., 

We  may  express  this  a  little  more  generally  by  saying  that  internal 
multiplication  is  associative  when  performed  on  a  series  of  matrices, 
or  on  such  a  series  terminated  by  a  quantity  of  the  first  degree. 

Another  kind  of  multiplication  of  binary  indeterminate  products 
is  that  in  which  the  preceding  factors  are  multiplied  combinatoriaUy, 
and  also  the  following.    It  may  be  defined  by  the  equation 

(a\\)l(PMZ(y\v)  =  aX/3xy\\xpXV. 
This  defines  a  multiplication  of  matrices  denoted  by  the  same  symbol, 

This  multiplication,  which  is  associative  and  commutative,  is  of  great 
importance  in  the  theory  of  determinants.    In  fact, 
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is  the  determinant  of  the  matrix  #.  A  lower  power,  as  the  m*,  with 
the  divisor  n(n  — 1) ...  (n— m+1)  would  express  as  multiple  quantity 
all  the  subdeterminants  of  order  m.* 

It  is  evident  that  by  the  combination  of  the  operations  of  inde- 
terminate, algebraic,  and  combinatorial  multiplication  we  obtain 
multiple  quantities  of  a  more  complicated  nature  than  by  the  use 
of  only  one  of  these  kinds  of  multiplication.  The  indeterminate 
product  of  combinatorial  products  we  have  already  mentioned.  The 
combinatorial  product  of  algebraic  products,  and  the  indeterminate 
product  of  algebraic  products,  are  also  of  great  importance,  especially 
in  the  theory  of  quantics.  These  three  multiplications,  with  the 
internal,  especially  in  connection  with  the  general  property  of  the 
indeterminate  product  given  above,  and  the  derivation  of  the  algebraic 
and  combinatorial  products  from  the  indeterminate,  which  affords  a 
generalisation  of  that  property,  give  rise  to  a  great  wealth  of  multi- 
plicative relations  between  these  multiple  quantitiea  I  say  "  wealth 
of  multiplicative  relations"  designedly,  for  there  is  hardly  any  kind 
of  relations  between  things  which  are  the  objects  of  mathematical 
study,  which  add  so  much  to  the  resources  of  the  student  as  those 
which  we  call  multiplicative,  except  perhaps  the  simpler  class  which 
we  call  additive,  and  which  are  presupposed  in  the  multiplicative. 
This  is  a  truth  quite  independent  of  our  using  any  of  the  notations 
of  multiple  algebra,  although  a  suitable  notation  for  such  relations 
will  of  course  increase  their  value. 

Perhaps,  before  closing,  I  ought  to  say  a  few  words  on  the  appli- 
cations of  multiple  algebra. 

First  of  all,  geometry,  and  the  geometrical  sciences  which  treat  of 
things  having  position  in  space,  kinematics,  mechanics,  astronomy 
physics,  crystallography,  seem  to  demand  a  method  of  this  kind,  for 
position  in  space  is  essentially  a  multiple  quantity  and  can  only  be 
represented  by  simple  quantities  in  an  arbitrary  and  cumbersome 
manner.  For  this  reason,  and  because  our  spatial  intuitions  are  more 
developed  than  those  of  any  other  class  of  mathematical  relations, 
these  subjects  are  especially  adapted  to  introduce  the  student  to  the 
methods  of  multiple  algebra.  Here,  Nature  herself  takes  us  by  the 
hand  and  leads  us  along  by  easy  steps,  as  a  mother  teaches  her  child 
to  walk.    In  the  contemplation  of  such  subjects,  Mtfbius,  Hamilton, 

*  Quadratic  matrices  may  also  be  represented  by  a  sum  of  indeterminate  products  of 
a  quantity  of  the  first  degree  with  a  combinatorial  product  of  (n-  l)st  degree,  as,  for 
example,  when  n=4,  by  a  sum  of  produots  of  the  form 

a  1 0x7x9. 

The  theory  of  suoh  matrices  is  almost  identical  with  that  of  those  of  the  other  form, 
except  that  the  external  multiplication  takes  the  place  of  the  internal,  in  the  multipli- 
cation of  the  matrices  with  eaoh  other  and  with  quantities  of  the  first  degree. 
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and  Grassmann  formed  their  algebras,  although  the  philosophical 
mind  of  the  last  was  not  satisfied  until  he  had  produced  a  system 
unfettered  by  any  spatial  relations.  It  is  probably  in  connection  with 
some  of  these  subjects  that  the  notions  of  multiple  algebra  are  most 
widely  disseminated. 

Maxwell's  Treatise  on  Electricity  and  Magnetism  has  done  so 
much  to  familiarize  students  of  physics  with  quaternion  notations, 
that  it  seems  impossible  that  this  subject  should  ever  again  be  entirely 
divorced  from  the  methods  of  multiple  algebra. 

I  wish  that  I  could  say  as  much  of  astronomy.  It  is,  I  think,  to  be 
regretted,  that  the  oldest  of  the  scientific  applications  of  mathematics, 
the  most  dignified,  the  most  conservative,  should  keep  so  far  aloof 
from  the  youngest  of  mathematical  methods;  and  standing  as  I  do 
today,  by  some  chance,  among  astronomers,  although  not  of  the  guild, 
I  cannot  but  endeavor  to  improve  the  opportunity  by  expressing  my 
conviction  of  the  advantages  which  astronomers  might  gain  by 
employing  some  of  the  methods  of  multiple  algebra.  A  very  few  of 
the  fundamental  notions  of  a  vector  analysis,  the  addition  of  vectors 
and  what  quaternionists  would  call  the  scalar  part  and  the  vector 
part  of  the  product  of  two  vectors  (which  may  be  defined  without 
the  notion  of  the  quaternion), — these  three  notions  with  some  four 
fundamental  properties  relating  to  them  are  sufficient  to  reduce 
enormously  the  labor  of  mastering  such  subjects  as  the  elementary 
theory  of  orbits,  the  determination  of  an  orbit  from  three  observations, 
the  differential  equations  which  are  used  in  determining  the  best  orbit 
from  an  indefinite  number  of  observations  by  the  method  of  least 
squares,  or  those  which  give  the  perturbations  when  the  elements  are 
treated  as  variable.  In  all  these  subjects  the  analytical  work  is 
greatly  simplified,  and  it  is  far  easier  to  find  the  best  form  for 
numerical  calculation  than  by  the  use  of  the  ordinary  analysis. 

I  may  here  remark  that  in  its  geometrical  applications  multiple 
algebra  will  naturally  take  one  of  two  principal  forms,  according  as 
vectors  or  points  are  taken  as  elementary  quantities,  i.e.,  according  as 
something  having  magnitude  and  direction,  or  something  having 
magnitude  and  position  at  a  point,  is  the  fundamental  conception. 
These  forms  of  multiple  algebra  may  be  distinguished  as  vector 
analysis  and  point  analysis.  The  former  we  may  call  a  triple,  the 
latter  a  quadruple  algebra,  if  we  determine  the  degree  of  the  algebra 
from  the  degree  of  multiplicity  of  the  fundamental  conception.  The 
former  is  included  in  the  latter,  since  the  subtraction  of  points  gives 
us  vectors,  and  in  this  way  Grassmann's  vector  analysis  is  included  in 
his  point  analysis.  Hamilton's  system,  in  which  the  vector  is  the 
fundamental  idea,  is  nevertheless  made  a  quadruple  algebra  by  the 
addition  of  ordinary  numerical  quantities.     For  practical  purposes  we 
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may  regard  Hamilton's  system  as  equivalent  to  Qrassmann's  algebra 
of  vectors.  Such  practical  equivalence  is  of  course  consistent  with 
great  differences  of  notation,  and  of  the  point  of  view  from  which  the 
subject  is  regarded. 

Perhaps  I  should  add  a  word  in  regard  to  the  nature  of  the  problems 
which  require  a  vector  analysis,  or  the  more  general  form  of  Grass- 
mann's  point  analysis.  The  distinction  of  the  problems  is  very  marked, 
and  corresponds  precisely  to  the  distinction  familiar  to  all  analysts 
between  problems  which  are  suitable  for  Cartesian  coordinates,  and 
those  which  are  suitable  for  the  use  of  tetrahedral,  or,  in  plane 
geometry,  triangular  coordinates.  Thus,  in  mechanics,  kinematics, 
astronomy,  physics,  or  crystallography,  Qrassmann's  point  analysis 
will  rarely  be  wanted.  One  might  teach  these  subjects  for  years  by  a 
vector  analysis,  and  never  perhaps  feel  the  need  of  any  of  the  notions 
or  notations  which  are  peculiar  to  the  point  analysis,  precisely  as  in 
ordinary  algebra  one  might  use  the  Cartesian  coordinates  in  teaching 
these  subjects,  without  any  occasion  for  the  use  of  tetrahedral  coor- 
dinates. I  think  of  one  exception,  which,  however,  confirms  the 
rule.  The  very  important  theory  of  forces  acting  on  a  rigid  body  is 
much  better  treated  by  point  analysis  than  by  vector  analysis,  exactly 
as  in  ordinary  algebra  it  is  much  better  treated  by  tetrahedral  coor- 
dinates than  by  Cartesian, — I  mean  for  the  purpose  of  the  elegant 
development  of  general  propositions.  A  sufficient  theory  for  the 
purposes  of  numerical  calculations  can  easily  enough  be  given  by  any 
method,  and  the  most  familiar  to  the  student  is  for  such  practical 
purposes  of  course  the  best.  On  the  other  hand,  the  projective  pro- 
perties of  bodies,  the  relations  of  collinearity,  and  similar  subjects, 
seem  to  demand  the  point  analysis  for  their  adequate  treatment. 

If  I  have  said  that  the  algebra  of  vectors  is  contained  in  the  algebra 
of  points,  it  does  not  follow  that  in  a  certain  sense  the  algebra  of 
points  is  not  deducible  from  the  algebra  of  vectors.  In  mathematics, 
a  part  often  contains  the  whole.  If  we  represent  points  by  vectors 
drawn  from  a  common  origin,  and  then  develop  those  relations 
between  such  vectors  representing  points,  which  are  independent  of 
the  position  of  the  origin, — by  this  simple  process  we  may  obtain  a 
large  part,  possibly  all,  of  an  algebra  of  points.  In  this  way  the 
vector  analysis  may  be  made  to  serve  very  conveniently  for  many  of 
those  subjects  which  I  have  mentioned  as  suitable  for  point  analysis. 
The  vector  analysis,  thus  enlarged,  is  hardly  to  be  distinguished  from 
a  point  analysis,  but  the  treatment  of  the  subject  in  this  way  has 
somewhat  of  a  makeshift  character,  as  distinguished  from  the  unity 
and  simplicity  of  the  subject  when  developed  directly  from  the  idea  of 
something  situated  at  a  point. 

Of  those  subjects  which  have  no  relations  to  space,  the  elementary 
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theory  of  eliminations  and  substitutions,  including  the  theory  of 
matrices  and  determinants,  seems  to  afford  the  most  simple  application 
of  multiple  algebra.  I  have  already  indicated  what  seems  to  me  the 
appropriate  foundation  for  the  theory  of  matrices.  The  method  is 
essentially  that  which  Orassmann  has  sketched  in  his  first  Ausdehn- 
ungdehre  under  the  name  of  the  open  product  and  has  developed 
at  length  in  the  second. 

In  the  theory  of  quantics  Grassmann's  algebraic  product  finds  an 
application,  the  quantic  appearing  as  a  sum  of  algebraic  products  in 
Grassmann's  sense  of  the  term.  As  it  has  been  stated  that  these 
products  are  subject  to  the  same  laws  as  the  ordinary  products  of 
algebra,  it  may  seem  that  we  have  here  a  distinction  without  an 
important  difference.  If  the  quantics  were  to  be  subject  to  no  farther 
multiplications,  except  the  algebraic  in  Grassmann's  sense,  such  an 
objection  would  be  valid.  But  quantics  regarded  as  sums  of  algebraic 
products,  in  Grassmann's  sense,  are  multiple  quantities  and  subject  to 
a  great  variety  of  other  multiplications  than  the  algebraic,  by  which 
they  were  formed.  Of  these  the  most  important  are  doubtless  the 
combinatorial,  the  internal,  and  the  indeterminate.  The  combinatorial 
and  the  internal  may  be  applied,  not  only  to  the  quantic  as  a  whole  or 
to  the  algebraic  products  of  which  it  consists,  but  also  to  the  indi- 
vidual factors  in  each  term,  in  accordance  with  the  general  principle 
which  has  been  stated  with  respect  to  the  indeterminate  product  and 
which  will  apply  also  to  the  algebraic,  since  the  algebraic  may  be 
regarded  as  a  sum  of  indeterminate  product*. 

In  the  differential  and  integral  calculus  it  is  often  advantageous 
to  regard  as  multiple  quantities  various  sets  of  variables,  especially 
the  independent  variables,  or  those  which  may  be  taken  as  such. 
It  is  often  convenient  to  represent  in  the  form  of  a  single  differential 
coefficient,  as 

dr 

<*p 

a  block  or  matrix  of  ordinary  differential  coefficients.  In  this 
expression,  p  may  be  a  multiple  quantity  representing  say  n  inde- 
pendent variables,  and  t  another  representing  perhaps  the  same 
number  of  dependent  variables.  Then  dp  represents  the  n  differ- 
entials of  the  former,  and  dr  the  n  differentials  of  the  latter.  The 
whole  expression  represents  an  operator  which  turns  dp  into  dr, 
so  that  we  may  write  identically 

dr=£dp. 

Here  we  see  a  matrix  of  n*  differential  coefficients  represented  by 
a  quotient.  This  conception  is  due  to  Grassmann,  as  well  as  the 
representation  of  the  matrix  by  a  sum  of  products,  which  we  have 
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already  considered.  It  is  to  be  observed  that  these  multiple  differ- 
ential coefficients  are  subject  to  algebraic  laws  very  similar  to  those 
which  relate  to  ordinary  differential  coefficients  when  there  is  a 
single  independent  variable,  e.g., 

dxr  dr^cUr 
3t  ap    op' 

In  the  integral  calculus,  the  transformation  of  multiple  integrals 
by  change  of  variables  is  made  very  simple  and  clear  by  the  methods 
of  multiple  algebra. 

In  the  geometrical  applications  of  the  calculus,  there  is  a  certain 
class  of  theorems,  of  which  Green's  and  Poisson's  are  the  most 
notable  examples,  which  seem  to  have  been  first  noticed  in  connection 
with  certain  physical  theories,  especially  those  of  electricity  and 
magnetism,  and  which  have  only  recently  begun  to  find  their  way 
into  treatises  on  the  calculus.  These  not  only  find  simplicity  of 
expression  and  demonstration  in  the  infinitesimal  calculus  of  multiple 
quantities,  but  also  their  natural  position,  which  they  hardly  seem 
to  find  in  the  ordinary  treatises. 

But  I  do  not  so  much  desire  to  call  your  attention  to  the  diversity 
of  the  applications  of  multiple  algebra,  as  to  the  simplicity  and 
unity  of  its  principles.  The  student  of  multiple  algebra  suddenly 
finds  himself  freed  from  various  restrictions  to  which  he  has  been 
accustomed.  To  many,  doubtless,  this  liberty  seems  like  an  invi- 
tation to  license.  Here  is  a  boundless  field  in  which  caprice  may 
riot.  It  is  not  strange  if  some  look  with  distrust  for  the  result 
of  such  an  experiment.  But  the  farther  we  advance,  the  more 
evident  it  becomes  that  this  too  is  a  realm  subject  to  law.  The 
more  we  study  the  subject,  the  more  we  find  all  that  is  most  useful 
and  beautiful  attaching  itself  to  a  few  central  principles.  We 
begin  by  studying  multiple  algebras;  we  end,  I  think,  by  studying 
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ON  THE  DETERMINATION  OF  ELLIPTIC  ORBITS  FROM 
THREE  COMPLETE  OBSERVATIONS. 

[Memoirs  of  the  National  Academy  of  Sciences,  vol.  IV.  part  n. 
pp.  79-104,  1889.] 

The  determination  of  an  orbit  from  three  complete  observations 
by  the  solution  of  the  equations  which  represent  elliptic  motion 
presents  so  great  difficulties  in  the  general  case,  that  in  the  first 
solution  of  the  problem  we  must  generally  limit  ourselves  to  the  case 
in  which  the  intervals  between  the  observations  are  not  very  long.  In 
this  case  we  substitute  some  comparatively  simple  relations  between 
the  unknown  quantities  of  the  problem,  which  have  an  approximate 
validity  for  short  intervals,  for  the  less  manageable  relations  which 
rigorously  subsist  between  these  quantities.  A  comparison  of  the 
approximate  solution  thus  obtained  with  the  exact  laws  of  elliptic 
motion  will  always  afford  the  means  of  a  closer  approximation,  and 
by  a  repetition  of  this  process  we  may  arrive  at  any  required  degree 
of  accuracy. 

It  is  therefore  a  problem  not  without  interest — it  is,  in  fact,  the 
natural  point  of  departure  in  the  study  of  the  determination  of  orbits 
— to  express  in  a  manner  combining  as  far  as  possible  simplicity  and 
accuracy  the  relations  between  three  positions  in  an  orbit  separated 
by  small  or  moderate  intervals.  The  problem  is  not  entirely  deter- 
minate, for  we  may  lay  the  greater  stress  upon  simplicity  or  upon 
accuracy ;  we  may  seek  the  most  simple  relations  which  are  sufficiently 
accurate  to  give  us  any  approximation  to  an  orbit,  or  we  may  seek 
the  most  exact  expression  of  the  real  relations,  which  shall  not  be  too 
complex  to  be  serviceable. 

Derivation  of  the  Fundamental  Equation. 

The  following  very  simple  considerations  afford  a  vector  equation, 
not  very  complex  and  quite  amenable  to  analytical  transformation, 
which  expresses  the  relations  between  three  positions  in  an  orbit 
separated  by  small  or  moderate  intervals,  with  an  accuracy  far 
exceeding  that  of  the  approximate  relations  generally  used  in  the 
determination  of  orbits. 
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If  we  adopt  such  a  unit  of  time  that  the  acceleration  due  to  the 
sun's  action  is  unity  at  a  unit's  distance,  and  denote  the  vectors* 
drawn  from  the  sun  to  the  body  in  its  three  positions  by  9^,  9tj,  9t8, 
and  the  lengths  of  these  vectors  (the  heliocentric  distances)  by  rv 
rs>  r8,  the  accelerations  corresponding  to  the  three  positions  will  be 

represented  by J,  —  ^f,  —  — §.     Now  the  motion  between  the 

ri        rt        rz 
positions  considered  may  be  expressed  with  a  high  degree  of  accuracy 
by  an  equation  of  the  form 

having  five  vector  constants.    The  actual  motion  rigorously  satisfies 
six  conditions,  viz.,  if  we  write  t8  for  the  interval  of  time  between  the 

*  Vectors,  or  directed  quantities,  will  be  represented  in  this  paper  by  German  capitals. 
The  following  notations  will  be  used  in  connection  with  them : 

The  sign  =  denotes  identity  in  direction  as  well  as  length. 

The  sign  +  denotes  geometrioal  addition,  or  what  is  called  composition  in  mechanics. 

The  sign  -  denotes  reversal  of  direction,  or  composition  after  reversal 

The  notation  8.8  denotes  the  produot  of  the  lengths  of  the  vectors  and  the  cosine  of 
the  angle  whioh  they  inolude.  It  will  be  called  the  direct  product  of  8  and  8.  If 
x,  y,  z  are  the  rectangular  components  of  8,  and  *\  y*t  %'  those  of  8, 

8.8=*r/+yy'+»\ 

8.8  may  be  written  8*  and  called  the  square  of  8. 

The  notation  8x8  will  be  used  to  denote  a  veotor  of  whioh  the  length  is  the  produot 
of  the  lengths  of  8  and  8  and  the  sine  of  the  angle  which  they  include.  Its  direction 
is  perpendicular  to  8  and  8,  and  on  that  side  on  whioh  a  rotation  from  8  to  8 
appears  counter-clookwise.  It  will  be  called  the  ehew  product  of  8  and  8.  If  the 
rectangular  components  of  8  and  8  are  x,  y,  s,  and  x*t  y*t  s',  those  of  8x8  will  be 
yz'-zy',       zx'-aa',        xy^-yxf. 

The  notation  (88$)  denotes  the  volume  of  the  parallelopiped  of  whioh  three  edges  are 
obtained  by  laying  off  the  vectors  8,  8,  and  (£  from  any  same  point,  whioh  volume  is  to 
be  taken  positively  or  negatively,  according  as  the  veotor  $  falls  on  the  side  of  the  plane 
containing  8  and  8,  on  whioh  a  rotation  from  8  to  8  appears  oounter-olookwise,  or  on 
the  other  side.  If  the  rectangular  components  of  8,  8,  and  $  are  *,  y,  t ;  x',yft%'; 
and*",y",s", 

x     y     z 
(88€)=    x*    y1    %' 
x"    y"    s" 

It  follows,  from  the  above  definitions,  that  for  any  vectors  8,  8,  and  $ 
8.8=8.8,       8x8=  -8x8, 
(88€)=(8<£8)=(<E88)=  -(8€8)=  -(€««)=  -(88(E), 
and  (88C)=8.(8x<J)=8.(«x8)=(J.(8x8) ; 

also  that  8 .  8,  8  x  8,  are  distributive  functions  of  8  and  8,  and  (88(5)  a  distributive 
funotion  of  8,  8,  and  (J,  for  example,  that  if  8=8 +2R, 

8.8=8.8+9W.8,       8x8=*x8+2Rx8,       (88<£)= (88«)+(2R8G), 
and  so  for  8  and  $. 

The  notation  (88<S)  is  identical  with  that  of  Lagrange  in  the  Mecanique  Analytigue, 
except  that  there  its  use  is  limited  to  unit  vectors.  The  signification  of  9x8  is  closely 
related  to,  but  not  identical  with,  that  of  the  notation  [r,rs]  commonly  used  to  denote 
the  double  area  of  a  triangle  determined  by  two  positions  in  an  orbit. 
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first  and  second  positions,  and  rt  for  that  between  the  second  and 
third  and  set  *«0  for  the  second  position, 

for  *«—  Tt, 


wl^alj, 

for  t-0, 

<R"9t|, 

for  t~rv 

<Fl*<jtg, 

We  may  therefore  write  with  a  high  degree  of  approximation 

-^-2C-ert©+12rt*C 

-^-2«+6t1D+12t1»«. 

From  these  six  equations  the  five  constants  It,  8,  tt,  ©,  S  may  be 
eliminated,  leaving  a  single  equation  oi  the  form 

^i(1+§)«i-(1-§)^+^i(1+§)^-0'       (1) 

where  .         Tt  ,         Ta 

^-AW+T.T.-T^ 

This  we  shall  call  our  fundamental  equation.    In  order  to  discuss 
its  geometrical  signification,  let  us  set 

ih-^(l+§)    «,«(l-$>    ".-^+§>  W 

bo  that  the  equation  will  read 

n^-njttj+ttjH,-*).  (8) 

This  expresses  that  the  vector  *,{&,  is  the  diagonal  of  a  parallelogram 
of  which  n^  and  n,^  are  sides.  If  we  multiply  by  5R,  and  by  8^, 
in  skew  multiplication,  we  get 

n^xHt-Tt^x*,-*),    -n^xHt+iS^xHt-O,  (4) 

whence  8tixXt»8tix*ia8tix8ti.  (5) 
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Our  equation  may  therefore  be  regarded  as  signifying  that  the  three 
vectors  9^,  SR,,  SR,  lie  in  one  plane,  and  that  the  three  triangles 
determined  each  by  a  pair  of  these  vectors,  and  usually  denoted  by 
[r«r«]»  [rirJ»  trirJ»  ^^  proportional  to 

Since  this  vector  equation  is  equivalent  to  three  ordinary  equations, 
it  is  evidently  sufficient  to  determine  the  three  positions  of  the  body 
in  connection  with  the  conditions  that  these  positions  must  lie  upon 
the  lines  of  sight  of  three  observations.  To  give  analytical  expression 
to  these  conditions,  we  may  write  (2^,  6,,  C,  for  the  vectors  drawn 
from  the  sun  to  the  three  positions  of  the  earth  (or,  more  exactly,  of 
the  observatories  where  the  observations  have  been  made),  fjlf  fj,,  fjt 
for  unit  vectors  drawn  in  the  directions  of  the  body,  as  observed, 
and  ft,  ft,  ft  for  the  three  distances  of  the  body  from  the  places  of 
observation.    We  have  then 

«i*«i+/h8i>    «i-^+*A.    *.*«s+P»8r  («) 

By  substitution  of  these  values  our  fundamental  equation  becomes 

where  ft,  ft,  ft,  rl9  rt,  r%  (the  geocentric  and  heliocentric  distances) 
are  the  only  unknown  quantities.  From  equations  (6)  we  also  get, 
by  squaring  both  members  in  each, 

r,»-(E,t+2(«r8,)p,+p,«>  j  W 

by  which  the  values  of  rv  r„  rt  may  be  derived  from  those  of 
ft,  ft,  ft,  or  vice  versd.  Equations  (7)  and  (8),  which  are  equivalent 
to  six  ordinary  equations,  are  sufficient  to  determine  the  six  quantities 

rv  r«»  r*>  Pi>  fit*  P»  >  or»  **  we  ^P086  *^e  ▼•hws  of  rlf  rt,  r8  in  terms 
of  ft,  ft,  ft  to  be  substituted  in  equation  (7),  we  have  a  single  vector 
equation,  from  which  we  may  determine  the  three  geocentric  distances 

Pi*  Pt>  Pi- 
It  remains  to  be  shown,  first,  how  the  numerical  solution  of  the 
equation  may  be  performed,  and  secondly,  how  such  an  approximate 
solution  of  the  actual  problem  may  furnish  the  basis  of  a  closer 
approximation. 

Solution  of  the  Fundamental  Equation. 

The  relations  with  which  we  have  to  do  will  be  rendered  a  little 
more  simple  if  instead  of  each  geocentric  distance  we  introduce  the 
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distance  of  the  body  from  the  foot  of  the  perpendicular  from  the  sun 
upon  the  line  of  sight    If  we  set 

?i-Pi+(«AX        ?i-A+<MtX        3»=/>.+(er&t)>  W 

ft,-«1i-<«W^    A^V-CM^.     ft-V-«W.      (10> 
equations  (8)  become 


»-i,«-9ri"+3»i*. 


V-Ji'+A*. 


'••-fc'+ft". 


Let  us  also  set,  for  brevity, 

€^,(i +§)<«,+*&* 


(ii) 


(12) 


5"-t' 


Then  <&v  ®,,  @t  may  be  regarded  as  functions  respectively  of 
P\>  Pt>  P%*  therefore  of  qlt  qv  qit  and  if  we  set 

dql 

and  ©«©!+©«+©„ 

we  shall  have 

dS-©'^*©"^*©'"^,. 

To  determine  the  value  of  ©',  we  get  by  differentiation 


(13) 
(14) 
(15) 


But  by  (11) 


Therefore 


*-^(i+§)ft-^j£<*+*ft> 


«9i    V 


(16) 
(17) 

(18) 


®'"=^(i+§)5.-r7^r^®. 

Now  if  any  values  of  <fr,  qif  qt  (either  assumed  or  obtained  by  a 
previous  approximation)  give  a  certain  residual  ©  (which  would  be 
zero  if  the  values  of  qv  q%t  qt  satisfied  the  fundamental  equation),  and 
we  wish  to  find  the  corrections  Aqlf  Aq%,oAqs,  which  must  be  added 
to  qlt  qv  qz  to  reduce  the  residual  to  zero,  we  may  apply  equation  (15) 
to  these  finite  differences,  and  will  have  approximately,  when  these 
differences  are  not  very  large, 

-e-CAft+GTAfc+e"^,.  (19) 
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This  gives  * 


(@®"®"') 
a« (©'©"©'") 


A«fc--(i^<20) 


From  the  corrected  values  of  qv  q%,  qt  we  may  calculate  a  new  residual  ®t 
and  from  that  determine  another  correction  for  each  of  the  quantities 

9v9t>9t- 

It  will  sometimes  be  worth  while  to  use  formula  a  little  less  simple 
for  the  sake  of  a  more  rapid  approximation.  Instead  of  equation  (19) 
we  may  write,  with  a  higher  degree  of  accuracy, 

-  ®  -  ®'A<h+ ©>fc+ ®'"Afc+ iar(A(h)*+ iZ"(Aqtf+ VC"(Aq#,  (21) 

where 

■PS.    ..  .^fr,-1).  .       B,       <!■(«■,-'), 


(22) 


It  is  evident  that  X"  is  generally  many  times  greater  than  I'  or  I'7', 
the  factor  2?s,  in  the  cade  of  equal  intervals,  being  exactly  ten  times  as 
great  as  AlBl  or  A^.  This  shows,  in  the  first  place,  that  the  accurate 
determination  of  A</s  is  of  the  most  importance  for  the  subsequent 
approximations.  It  also  shows  that  we  may  attain  nearly  the  same 
accuracy  in  writing 

-®  =  ©/Ag1+®//Aft+®///Ag8+ir/A^».  (28) 

We  may,  however,  often  do  a  little  better  than  this  without  using 
a  more  complicated  equation.  For  I'+I'"  may  be  estimated  very 
roughly  as  equal  to  \X".  Whenever,  therefore,  Aft  and  Aqt  are  about 
as  large  as  Aqv  as  is  often  the  case,  it  may  be  a  little  better  to  use  the 
coefficient  ^  instead  of  |  in  the  last  term. 

For  Aqt>  then,  we  have  the  equation 

-(®®w®')=(@/®//®///)Aft^-A(^/'®///®/)Aft,.         (24) 

(£"®'"®/)  is  easily  computed  from  the  formula 


(25) 


or©'"©')  -  -(i  -  s2i)((@'®''®'")+(8l®'"®')Y 

which  may  be  derived  from  equations  (18)  and  (22). 

*  These  equation*  are  obtained  by  taking  the  direot  product*  of  both  member*  of  the 
preceding  equation  with  €>"  x  €>"',  €>'"  x  €>',  and  €>'  x  ©",  respectively.  8ee  footnote 
on  page  119. 
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The  quadratic  equation  (24)  gives  two  values  of  the  correction  to  be 
applied  to  the  position  of  the  body.  When  they  are  not  too  large, 
they  will  belong  to  two  different  solutions  of  the  problem,  generally 
to  the  two  least  removed  from  the  values  assumed.  But  a  very  large 
value  of  Aq%  must  not  be  regarded  as  affording  any  trustworthy 
indication  of  a  solution  of  the  problem.  In  the  majority  of  cases  we 
only  care  for  one  of  the  roots  of  the  equation,  which  is  distinguished 
by  being  very  small,  and  which  will  be  most  easily  calculated  by  a 
small  correction  to  the  value  which  we  get  by  neglecting  the  quadratic 
term.* 

When  a  comet  is  somewhat  near  the  earth  we  may  make  use  of  the 
fact  that  the  earth's  orbit  is  one  solution  of  the  problem,  i.e.,  that  — />, 
is  one  value  of  Ag2,  to  save  the  trifling  labor  of  computing  the  value 
of  (£"©"'©')•  For  it  is  evident  from  the  theory  of  equations  that 
if  —  p%  and  z  are  the  two  roots, 

(@'<g"©'")  (@<g'"@') 

*~9wmiQewW)  "^=i(r>©//'®>y 

Eliminating  (X"&"<&'),  we  have 

(*-*)(®®",@/)=  -/>,*(®'©"©'"), 
whence 

1    }_    (©'©"©'") 

z~to~  (©©'"©')• 


Now  —7&PZ&&&A  ta  the  value  of  Aqv  which  we  obtain  if  we  neglect 

the  quadratic  term  in  equation  (24).    If  we  call  this  value  [A<fo],  we 
have  for  the  more  exact  value  t 

The.  quantities  Agt  and  Ag,  might  be  calculated  by  the  equations 
n«(©'®"©"')Ag1+A0r®"®"')A?t»  1 
V)  =(@'@"©'")Ag,+ A(I"@'©")  A?,*  J 


*  In  the  oase  of  Swift's  oomet  (V,  1880),  the  writer  found  by  the  quadratic  equation 
-  -247  and  -  116  for  oorreotions  of  the  assumed  geooentrio  distance  *250.  The  first  of 
these  numbers  gives  an  approximation  to  the  position  of  the  earth ;  the  seoond  to  that 
of  the  oomet,  vis.,  the  geooentrio  distanoe  *134  instead  of  the  true  value  *1888.  The 
ooeffioient  A  was  used  in  the  quadratio  equation ;  with  the  ooeffioient  \  the  approxi- 
mations  would  not  be  quite  so  good.  The  value  of  the  correction  obtained  by  negleoting 
the  quadratio  term  was  "079,  which  indicates  that  the  approximations  (in  this  very 
oritioal  oase)  would  be  quite  tedious  without  the  use  of  the  quadratio  term. 

fin  the  oase  mentioned  in  the  preceding  footnote,  from  [Aft]=  -  *W$  and  ft  3*28, 
we  get  A?s=  -  allfi6,  whioh  is  sensibly  the  same  value  as  that  obtained  by  t^\im\m<tA^g 
the  quadratio  term. 
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But  a  little  examination  will  show  that  the  coefficients  ol  Agt*  in 
these  equations  will  not  generally  have  very  different  values  from  the 
coefficient  of  the  same  quantity  in  equation  (24).  We  may  therefore 
write  with  sufficient  accuracy 

Aft-tAtfJ+Aft-fAgJ,    Ag.-tAfcJ+Ag.-tAft],        (28) 

where  [AgJ,  [AgJ,  [AgJ  denote  the  values  obtained  from  equations  (20). 
In  making  successive  corrections  of  the  distances  gl%  qv  qZi  it  will 
not  be  necessary  to  recalculate  the  values  of  ®',  ®",  ®"',  when  these 
have  been  calculated  from  fairly  good  values  of  qv  qv  qr  But  when, 
as  is  generally  the  case,  the  first  assumption  is  only  a  rude  guess,  the 
values  of  ®',  ®",  ®'"  should  be  recalculated  after  one  or  two  corrections 
of  qv  qit  qv  To  get  the  best  results  when  we  do  not  recalculate  ®', 
©",  ®'",  we  may  proceed  as  follows :  Let  ®',  ®",  ®'"  denote  the  values 
which  have  been  calculated ;  Dqv  Dqt,  Dqz,  respectively,  the  sum  of 
the  corrections  of  each  of  the  quantities  qv  qt>  gg,  which  have  been 
made  since  the  calculation  of  ®',  ®",  <&>'";  <$  the  residual  after  all  the 
corrections  of  ql$  qv  q9,  which  have  been  made;  and  Aglf  Ag2,  Agg 
the  remaining  corrections  which  we  are  seeking.  We  have,  then, 
very  nearly 

-®-{®/+r(Dg1+iAg1)}Ag1+{®//+r(2>g2+iAgt)}Agtl   (29) 
+  {®w+Iw(Dg,+iAgt)}Agr  J 

The  same  considerations  which  we  applied  to  equation  (21)  enable 
us  to  simplify  this  equation  also,  and  to  write  with  a  fair  degree  of 
accuracy 

-(6®'"®')-  {(©'©"©"O+t^'S'S^^t+iAg^lAg,,     (30) 
Aga-fAgJ+Ag^-tAgJ,    Ag.-tAgJ+Ag.-tAgJ,        (31) 

where 

rA    ,        (®®"®'")     rA„,        (®®w®')  (®®'®") 

lAg1]--^7g77g^,  LAg,J--(@^(gWy  LAgd=-((g^^y(32) 

Correction  of  the  Fundamental  Equation. 

When  we  have  thus  determined,  by  the  numerical  solution  of  our 
fundamental  equation,  approximate  values  of  the  three  positions  of 
the  body,  it  will  always  be  possible  to  apply  a  small  numerical 
correction  to  the  equation,  so  as  to  make  it  agree  exactly  with  the 
laws  of  elliptic  motion  in  a  fictitious  case  differing  but  little  from 
the  actual  After  such  a  correction  the  equation  will  evidently  apply 
to  the  actual  case  with  a  much  higher  degree  of  approximation. 

There  is  room  for  great  diversity  in  the  application  of  this  principle. 
The  method  which  appears  to  the  writer  the  most  simple  and  direct  is 
the  following,  in  which  the  correction  of  the  intervals  for  aberration 
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is  combined  with  the  correction  required  by  the  approximate  nature  of 
the  equation.* 

The  solution  of  the  fundamental  equation  gives  us  three  points, 
which  must  necessarily  lie  in  one  plane  with  the  sun,  and  in  the  lines 
of  sight  of  the  several  observations.  Through  these  points  we  may 
pass  an  ellipse,  and  calculate  the  intervals  of  time  required  by  the 
exact  laws  of  elliptic  motion  for  the  passage  of  the  body  between 
them.  If  these  calculated  intervals  should  be  identical  with  the  given 
intervals,  corrected  for  aberration,  we  would  evidently  have  the  true 
solution  of  the  problem.  But  suppose,  to  fix  our  ideas,  that  the 
calculated  intervals  are  a  little  too  long.  It  is  evident  that  if  we 
repeat  our  calculations,  using  in  our  fundamental  equation  intervals 
shortened  in  the  same  ratio  as  the  calculated  intervals  have  come  out 
too  long,  the  intervals  calculated  from  the  second  solution  of  the 
fundamental  equation  must  agree  almost  exactly  with  the  desired 
values.  If  necessary,  this  process  may  be  repeated,  and  thus  any 
required  degree  of  accuracy  may  be  obtained,  whenever  the  solution  of 
the  uncorrected  equation  gives  an  approximation  to  the  true  positions. 
For  this  it  is  necessary  that  the  intervals  should  not  be  too  great.  It 
appears,  however,  from  the  results  of  the  example  of  Ceres,  given 
hereafter,  in  which  the  heliocentric  motion  exceeds  62°  but  the 
calculated  values  of  the  intervals  of  time  differ  from  the  given  values 
by  little  more  than  one  part  in  two  thousand,  that  we  have  here  not 
approached  the  limit  of  the  application  of  our  formula. 

In  the  usual  terminology  of  the  subject,  the  fundamental  equation 
with  intervals  uncorrected  for  aberration  represents  the  first  hypothesis; 
the  same  equation  with  the  intervals  affected  by  certain  numerical 
coefficients  (differing  little  from  unity)  represents  the  second  hypothesis; 
the  third  hypothesis,  should  such  be  necessary,  is  represented  by  a 
similar  equation  with  corrected  coefficients,  etc. 

In  the  process  indicated  there  are  certain  economies  of  labor  which 
should  not  be  left  unmentioned,  and  certain  precautions  to  be  observed 
in  order  that  the  neglected  figures  in  our  computations  may  not 
unduly  influence  the  result. 

It  is  evident,  in  the  first  place,  that  for  the  correction  of  our  funda- 
mental equation  we  need  not  trouble  ourselves  with  the  position  of  the 
orbit  in  the  solar  system.  The  intervals  of  time,  which  determine  this 
correction,  depend  only  on  the  three  heliocentric  distances  rv  rv  rg  and 
the  two  heliocentric  angles,  which  will  be  represented  by  vi—vl  and 
v8— vv  if  we  write  v,,  vv  vt  for  the  true  anomalies.  These  angles 
(vi""vi  ana<  t>8~"v*)  mav  b®  determined  from  rv  r%t  rt  and  nl,  w,,  ng, 


*  When  an  approximate  orbit  is  known  in  advanoe,  we  may  oorreot  the  fundamental 
equation  at  once.    The  formulae  will  be  given  in  the  Summary,  |  xii 
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and  therefore  from  rv  rt,  r3  and  the  given  intervals.  For  our  funda- 
mental equation,  which  may  be  written 

nfa-nfa+nfa^O,  (33) 

indicates  that  we  may  form  a  triangle  in  which  the  lengths  of  the 
sides  shall  be  nYrlt  n%r%9  and  n^  (let  us  say  for  brevity,  sv  «2,  «,),  and 
the  directions  of  the  sides  parallel  with  the  three  heliocentric  directions 
of  the  body.  The  angles  opposite  8l  and  8$  will  be  respectively  v%—vt 
and  v2— vv    We  have,  therefore,  by  a  well-known  formula, 


tan 


v*~v*m    I  («i-*i+«i)(«i+«j-«i) 


3.  JJb 


2        'V(«l+«f+«8)(-«l+«2+«8) 


(34) 


As  soon,  therefore,  as  the  solution  of  our  fundamental  equation 
has  given  a  sufficient  approximation  to  the  values  of  rv  rv  r8  (say 
five-  or  six-figure  values,  if  our  final  result  is  to  be  as  exact  as 
seven-figure  logarithms  can  make  it),  we  calculate  14,  14,  14  with 
seven-figure  logarithms  by  equations  (2),  and  the  heliocentric  angles 
by  equations  (34). 

The  semi-parameter  corresponding  to  these  values  of  the  heliocentric 
distances  and  angles  is  given  by  the  equation 

p.VrVi+1l6.  (35) 

r       14-94+ tig 

The  expression  14—14+ 74,  which  occurs  in  the  value  of  the  semi- 
parameter,  and  the  expression  141^— 141*1 +141*3,  or  s^— «8+«s,  which 
occurs  both  in  the  value  of  the  semi-parameter  and  in  the  formulas  for 
determining  the  heliocentric  angles,  represent  small  quantities  of  the 
second  order  (if  we  call  the  heliocentric  angles  small  quantities  of  the 
first  order),  and  cannot  be  very  accurately  determined  from  approxi- 
mate numerical  values  of  their  separate  terms.  The  first  of  these 
quantities  may,  however,  be  determined  accurately  by  the  formula 

With  respect  to  the  quantity  8X—  8%+8t,  a  little  consideration  will  show 
that  if  we  are  careful  to  use  the  same  value  wherever  the  expression 
occurs,  both  in  the  formulas  for  the  heliocentric  angles  and  for  the 
semi-parameter,  the  inaccuracy  of  the  determination  of  this  value  from 
the  cause  mentioned  will  be  of  no  consequence  in  the  process  of 
correcting  the  fundamental  equation.  For  although  the  logarithm 
of  «!— *2+«8  as  calculated  by  seven-figure  logarithms  from  rv  rs,  rg 
may  be  accurate  only  to  four  or  five  figures,  we  may  regard  it  as 
absolutely  correct  if  we  make  a  very  small  change  in  the  value  of  one 
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of  the  heliocentric  distances  (say  rt ).  We  need  not  trouble  ourselves 
farther  about  this  change,  for  it  will  be  of  a  magnitude  which  we 
neglect  in  computations  with  seven-figure  tables.  That  the  helio- 
centric angles  thus  determined  may  not  agree  as  closely  as  they  might 
with  the  positions  on  the  lines  of  sight  determined  by  the  first  solution 
of  the  fundamental  equation  is  of  no  especial  consequence  in  the 
correction  of  the  fundamental  equation,  which  only  requires  the  exact 
fulfilment  of  two  conditions,  viz.,  that  our  values  of  the  heliocentric 
distances  and  angles  shall  have  the  relations  required  by  the  funda- 
mental equation  to  the  given  intervals  of  time,  and  that  they  shall 
have  the  relations  required  by  the  exact  laws  of  elliptic  motion  to 
the  calculated  intervals  of  time.  The  third  condition,  that  none  of 
these  values  shall  differ  too  widely  from  the  actual  values,  is  of  a 
looser  character. 

After  the  determination  of  the  heliocentric  angles  and  the  semi- 
parameter,  the  eccentricity  and  the  true  anomalies  of  the  three 
positions  may  next  be  determined,  and  from  these  the  intervals  of 
time.  These  processes  require  no  especial  notice.  The  appropriate 
formula  will  be  given  in  the  Summary  of  Formulas. 

• 

Determination  of  the  Orbit  from  the  Three  Positions  and  the 
Intervals  of  Time. 

The  values  of  the  semi-parameter  and  the  heliocentric  angles  as 
given  in  the  preceding  paragraphs  depend  upon  the  quantity  ^—Sj+Sg, 
the  numerical  determination  of  which  from  8V  «2,  and  a,  is  critical  to 
the  second  degree  when  the  heliocentric  angles  are  small.  This  was 
of  no  consequence  in  the  process  which  we  have  called  the  correction 
of  tiie  fundamental  equation.  But  for  the  actual  determination  of  the 
orbit  from  the  positions  given  by  the  corrected  equation— or  by  the 
uncorrected  equation,  when  we  judge  that  to  be  sufficient — a  more 
accurate  determination  of  this  quantity  will  generally  be  necessary. 
This  may  be  obtained  in  different  ways,  of  which  the  following  is 
perhaps  the  most  simple.    Let  us  set 

©4  =  ®s-@i>  (37) 

and  84  for  the  length  of  the  vector  ©4,  obtained  by  taking  the  square 
root  of  the  sum  of  the  squares  of  the  components  of  the  vector.  It  is 
evident  that  8%  is  the  longer  and  84  the  shorter  diagonal  of  a  parallelo- 
gram of  which  the  sides  are  8t  and  63.  The  area  of  the  triangle  having 
the  sides  811  8it  *8  is  therefore  equal  to  that  of  the  triangle  having  the 
sides  sv  «j,  80  each  being  one-half  of  the  parallelogram.    This  gives 
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***     ^^.J^$i+^^+^^^^+^(h+9j-^    (39) 

The  numerical  determination  of  this  value  of  «!—«,+«,  is  critical 
only  to  the  first  degree. 

The  eccentricity  and  the  true  anomalies  may  be  determined  in  the 
same  way  as  in  the  correction  of  the  formula.  The  position  of 
the  orbit  in  space  may  be  derived  from  the  following  considerations. 
The  vector  —  ®t  is  directed  from  the  sun  toward  the  second  position 
of  the  body;  the  vector  <S4  from  the  first  to  the  third  position. 
If  we  set 

®.-©«-^®i.  (40) 

the  vector  @5  will  be  in  the  plane  of  the  orbit,  perpendicular  to  — <S2 
and  on  the  side  toward  which  anomalies  increase.  If  we  write  «6  for 
the  length  of  ®5, 

-®f  and   ®* 

will  be  unit  vectors.  Let  3  and  3"  be  unit  vectors  determining  the 
position  of  the  orbit,  3  being  drawn  from  the  sun  toward  the  peri- 
helion, and  3"  at  right  angles  to  3,  in  the  plane  of  the  orbit,  and  on 
the  side  toward  which  anomalies  increase.    Then 

3«-coev2^-sinvt^*,  (41) 


3'= 


:-8mv3+co8i;®*.  (42) 

The  time  of  perihelion  passage  (T)  may  be  determined  from  any 
one  of  the  observations  by  the  equation 

^(t-^B-emnB,  (48) 

Or 

the  eccentric  anomaly  E  being  calculated  from  the  true  anomaly  v. 
The  interval  J— T  in  this  equation  is  to  be  measured  in  days.  A 
better  value  of  T  may  be  found  by  averaging  the  three  values  given 
by  the  separate  observations,  with  such  weights  as  the  circumstances 
may  suggest  But  any  considerable  differences  in  the  three  values 
of  T  would  indicate  the  necessity  of  a  second  correction  of  the 
formula,  and  furnish  the  basis  for  it 

For  the  calculation  of  an  ephemeris  we  have 

««-oe3+cos^a3+sin^&3'  (44) 

in  connection  with  the  preceding  equation. 

Sometimes  it  may  be  worth  while  to  make  the  calculations  for  the 
correction  of  the  formula  in  the  slightly  longer  form  indicated  for 
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the  determination  of  the  orbit  This  will  be  the  case  when  we  wish 
simultaneously  to  correct  the  formula  for  its  theoretical  imperfection, 
and  to  correct  the  observations  by  comparison  with  others  not  too 
remote.  The  rough  approximation  to  the  orbit  given  by  the  un- 
corrected formula  may  be  sufficient  for  this  purpose.  In  fact,  for 
observations  separated  by  very  small  intervals,  the  imperfection  of 
the  uncorrected  formula  will  be  likely  to  affect  the  orbit  less  than 
the  errors  of  the  observations. 

The  computer  may  prefer  to  determine  the  orbit  from  the  first  and 
third  heliocentric  positions  with  their  times.  This  process,  which  has 
certain  advantages,  is  perhaps  a  little  longer  than  that  here  given,  and 
does  not  lend  itself  quite  so  readily  to  successive  improvements  of  the 
hypothesis.  When  it  is  desired  to  derive  an  improved  hypothesis 
from  an  orbit  thus  determined,  the  formula  in  §  XII  of  the  summary 
may  be  used. 

SUMMARY  OF  FORMULAE 

WITH  DIRECTIONS  FOR  USE. 
(For  the  case  in  which  an  approximate  orbit  is  known  in  advance,  tee  XII.) 

I. 
Preliminary  amputations  relating  to  the  intervals  of  time. 
ti,t2,  tz= times  of  the  observations  in  days, 
log  k =8-2355814  (after  Gauss) 

*l  12  Dt  12  ^3-  12 

For  control :  Aflx+B%+AJ&%=±rirv 

II. 

Preliminary  computations  relating  to  the  first  observation. 

Xlf  Yv  Zx  (components  of  (g^the  heliocentric  coordinates  of  the 

earth,  increased  by  the  geocentric 
coordinates  of  the  observatory. 
€i>  tfi>  ?i  (components  of  gj)  =  the  direction-cosines  of  the  observed 

position,  corrected  for  the  aber- 
ration of  the  fixed  stars. 
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Prdiminary  computations  relating  to  the  second  and  third 
observation*. 

The  formulae  are  entirely  analogous  to  those  relating  to  the  first 
observation,  the  quantities  being  distinguished  by  the  proper  suffixes. 

in. 

Equations  of  ike  first  hypotJiesis. 

When  the  preceding  quantities  have  been  computed,  their  numerical 
values  (or  their  logarithms,  when  more  convenient  for  computation,) 
are  to  be  substituted  in  the  following  equations : 

Components  of  $, 

y1=^1U+i*1>(<71+-|| -(€,.&)) 

sl*=a1'+fr*+y1*=A1*(l  +  *i)V 

Components  of  6' 
a'-Al£1+Al£xRl-P'al' 
P-Afr+AflA-rfr    HI' 

y'-^+^A-Py,. 

Components  of  €>j 


?i=/h+(«i&) 

For  control : 

0+W 


*-3 


For  control : 


*"5 


«*— &(i-iy(ft+^-(^.w) 
A-  -*s(i-i2,)(?f+ J -(«,.&)) 
y,-  -«i  --^(ft+f  -(«.•&)) 

V=«*i+/V+y,s»(l-RJV 

Components  of  @" 
a"— &+&«,+ J%] 

0"=-*t+?A+i>"&  nr 
y'^-^+^+^y*. 

Components  of  @» 

«,=il^(i  +iy(ft+^_  («,.&)) 
&-4*<i+jy(?,+|*-<M0) 


m, 


m, 
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Components  of  ©"' 

/r-Ato+AfiA-r-fr  \  in"- 

y'"=A^+A^tRt-F"ytj 


3%, 


The  computer  is  now  to  assume  any  reasonable  values  either  of  the 
geocentric  distances,  plt  fa,  p3>  or  of  the  heliocentric  distances,  rl9  rt> 
r8  (the  former  in  the  case  of  a  comet,  the  latter  in  the  case  of  an 
asteroid),  and  from  these  assumed  values  to  compute  the  rest  of  the 
following  quantities : 


Mem  m„  nr. 

By  equations  IHg,  HI". 

By  equation*  m,,  DT 

9i 

9i 

9* 

logr, 

logrt 

logr, 

log  JJ, 

log.R, 

logfl, 

V>g(l+B0 

kg(l-jy 

log(l+iy 

log  J" 

logP" 

kgP* 

«i 

«t 

«» 

A 

A 

A 

yi 

yt 

y» 

a 

a 

a 

p 

F 

j8" 

y 

y" 

y'" 

IV. 

CalcuLatitms  vdatMig  to  differential  (^efficients. 
Components  of  $"x $'"        Components  of  ©'"x ©'        Components  of  $'x 6" 

a1=/8"y'"-y"/8"'       <h-F,-/~rP       <h=Py"-y'(r 

!=ya    —ay  ^f—y  a~<*  V  &$aya  —ay 

cx=*a  p   —pa  c2=a  p—p  a  (^^ap  —pa 

These  computations  are  controlled  by  the  agreement  of  the  three 
values  of  0. 

The  following  are  not  necessary  except  when  the  corrections  to  be 
made  are  large : 


■£(>+S)(«S)-fM 
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V. 

Corrections  of  the  geocentric  distances. 
Oomponeoto  xd  g. 

*-a+a+a   c,— sE±y±aa 

y-tt+y.+y,     ct--flg±y±tt 

(This  equation  will  generally  be  most  easily  solved  by  repeated 
substitutions.) 

VL 

Successive  corrections. 

Aql9  Aq%>  Aqt  are  to  be  added  as  corrections  to  qv  qv  q9.  With  the 
new  values  thus  obtained  the  computation  by  equations  ID^,  HI,,  HI, 
are  to  be  recommenced    Two  courses  are  now  open : 

(a)  The  work  may  be  carried  an  exactly  as  before  to  the  deter- 
mination of  new  corrections  for  qv  g2,  qv 

(b)  The  computations  by  equations  HI',  HE",  EQ"',  and  IV  may  be 
(Knitted,  and  the  old  values  of  04,  bp  Cj,  a,,  etc,  0,  and  L  may  be 
used  with  the  new  residuals  a,  /8,  y  to  get  new  corrections  for  qv  qv 
qt  by  the  equations 

Aft-C^+A},-^,        Aq9=Cs+Aqt-Ct, 

where  Dqt  denotes  the  former  correction  of  qv  (More  generally,  at 
any  stage  of  the  work,  Dqt  will  represent  the  sum  of  all  the  corrections 
of  qt  which  have  been  made  since  the  last  computation  of  04,  6lf  etc.) 

So  far  as  any  general  rule  can  be  given,  it  is  advised  to  recompute 
C4,  fcp  etc,  and  0  once,  perhaps  after  the  second  corrections  of  qv  qt> 
gs,  unless  the  assumed  values  represent  a  fair  approximation.  Whether 
L  is  also  to  be  recomputed,  depends  on  its  magnitude,  and  on  that  of 
the  correction  of  qt%  which  temains  to  be  made.  In  the  later  stages 
of  the  work,  when  the  corrections  are  small,  the  terms  containing  L 
may  be  neglected  altogether. 

The  corrections  of  qlt  q2,  q9  should  be  repeated  until  the  equations 

a»0        0=0        y-0 
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are  Dearly  satisfied.  Approximate  values  of  rt,  rs,  rs  may  suffice  for 
the  following  computations,  which,  however,  must  be  made  with  the 
greatest  exactness. 

VII. 

Test  of  the  first  hypothesis. 

logri>   logr2,    logr8   (approximate    values    from    the    preceding 
computations). 

4"iiiri+.412?1rrs 
s  =  i(«i+*i+*s) 

8mmm8i>  8^m8^t  £~»8g. 

The  value  of  *— aj  may  be  very  small,  and  its  logarithm  in  con- 
sequence ill  determined.  This  will  do  no  harm  if  the  computer  is 
careful  to  use  the  same  value — computed,  of  course,  as  carefully  as 
possible — wherever  the  expression  occurs  in  the  following  formulas : 

2(8— «-)  i.       w  \         B 

P-      jf  tanK".-*!)-  — 

,        ,,  v      8  —  8. 

tM»i(v»-"i)— ^"* 
For  adjustment  of  values:    l(vt—v1)=l(vt—vl)+l(vt—vj 

£_£ 

«  COS  iO's+^l)3*  o — Ht* v 

*v  8      17    2  cos  J  (v,—^) 
tani(V3+v,)  e* 

For  control:  ecoevt=±-— 1 


?=Vi+i 


_£_ 


tan  $^  =  6  tan  \vx        tan^2=etan  Jv2        tan  JJE,s=€tan|vs 
Tlctlc.=a*(j&j-^j)+«a*sin^1-ea*sinj&g 
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VIII. 
For  the  second  hypothesis. 
&Tx  f*  ,0057613A;(/91 — pj        (aberration-constant  after  Struve. ) 
#rs  -  -OOMeiSkifr  -  pj        log  (-0057613*)  -  5-99610 

AlogT1=logTl-log(Tl0llie.-*rl) 

AlogT3  =  logT,-log(T8  «*,-&-,) 

AlogOrjTjHAlogi^+AlogTs 
Alog^sAlogT!— AlogTg 

A  log  At  =  —  ASA  log  -1 

AlogJlg^-iljAlog^ 

Alog^-Alog^r^-^^Alog^ 

Alog^-Alog^r^+^^Alog^ 

Alog^.Alog^r^+^^Alog^ 

These  corrections  are  to  be  added  to  the  logarithms  of  At,  A9t 
Bv  Bt,  Bv  in  equations  ni^  HI,,  IIIj,  and  the  corrected  equations 
used  to  correct  the  values  of  qx>  qt,  q$,  until  the  residuals  a,  fi,  y 
vanish.  The  new  values  of  Av  At  must  satisfy  the  relation 
At+A$=lt  and  the  corrections  Alog^,  Alogil,  must  be  adjusted, 
if  necessary,  for  this  end. 

Third  hypothesis. 

A  second  correction  of  equations  inx,  HI,,  HI,  may  be  obtained  in 
the  same  manner  as  the  first,  but  this  will  rarely  be  necessary. 

IX. 
IkterrnwuUion  of  the  ellipse. 
It  is  supposed  that  the  values  of 

av  A>  Vi>        <4>  Pt»  Jt>        «$>  &>  V»» 

ri»   rV    r8»  &!>  ^t*  R*>  8l>    8V     83> 

have  been  computed  by  equations  IIIX,  IHj,  IIIg  with  the  greatest 
exactness,  so  as  to  make  the  residuals  a,  /3,  y  vanish,  and  that  the 
two  formulas  for  each  of  the  quantities  sl,  *2,  sz  give  sensibly  the  same 
value. 
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Components  of  ©4 
a4  =  at""«i 


Components  of  @6 


A-A-A  A-A-^+^+Y^A 


Vi^ys-Vi 


V-ct'+A'+ri* 


For  control  only : 


«(«-«lx»-«$) 

JT-4  A+^+-Mi  tan  K^-tg-^- 

2b»* 


R* 


The  computer  should  be  careful  to  use  the  corrected  values  of  Av  AP 
(See  VIII.)    Trifling  errors  in  the  angles  should  be  distributed. 

i\  8      1/    2co8$(ty-t>l) 

For  control:  ecoev«*=— —  1 

,1-e 


«-Vi+5    a"Ai    *-*/<<*> 

Direotion-oosines  of  semi-major  axis. 


,          C06V. 
6= <*t~ 


smv. 


?«5 


m 


_     cobv9a     sint;tA 


cos  v.        sin  v« 
»- — rvi — i~ 'v» 
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Direotion-oosinea  of  semi-minor  axis. 

sin  v.      ,  cos  tv 

Components  of  the  semi-axes. 
03=01  Oy=am  Og—an 

2>„as&\  byssbfi  bg=*bv 

X. 

Time  of  perihelion  passage. 

Corrections  for  Aberration. 

tan  JJ^  =»  <  tan  ±vx  ^  =  -  -0057618^ 

tani^2=etanjt;2  ^=  -  0057613^ 

tan  J2P, = e  tan  Jv8  ^  =  -  -0057613/^ 

log  -0057618 =7-76052 

«1+^-2T=Jfc-1a*(jF1-esin^1) 

^+Al-r->ib-1a*(jfft-e8in£rt) 

^+^-r«ib-la*(jyt-«8injyi) 

The  threefold  determination  of  T  affords  a  control  of  the  exactness 
of  the  solution  of  the  problem.  If  the  discrepancies  in  the  values  of  T 
are  such  as  to  require  another  correction  of  the  formulae  (a  third 
hypothesis),  this  may  be  based  on  the  equations 

AkgTt-Jf^Ja       AlogTS-Jf?9=5a 

where  7n))  Tw,  Tw  denote  respectively  the  values  obtained  from  the 
first,  second,  and  third  observations,  and  M  the  modulus  of  common 
logarithms. 

For  an  ephemeria. 

-M-T)=E-ennB 
a* 

Heliooen trio  coordinates.    (Components  of  9t.) 

a;=  —  eax+ctvCoaE+bxfsinE 

y=  —  coy+OyCos  E+by&iaE 

z=  —eai+agCOBE+bzsmE 

These  equations  are  completely  controlled  by  the  agreement  of  the 
computed  and  observed  positions  and  the  following  relations  between 
the  constants : 
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XIL 

When  an  approximate  orbit  is  known  in  advance,  we  may  use  it  to 
improve  our  fundamental  equation.  The  following  appears  to  be  the 
most  simple  method : 

Find  the  excentric  anomalies  Elt  E2>  2?8,  and  the  heliocentric 
distances  rv  r2,  r8,  which  belong  in  the  approximate  orbit  to  the  times 
of  observation  corrected  for  aberration. 

Calculate  Bl9  2?8,  as  in  §  I,  using  these  corrected  times. 

Determine  Av  As  by  the  equation 

gj) A(.*g) 

sm(2F8-^-esin.#8+e8in2?2    msi(Bt—E^emikB%+emnBl 

in  connection  with  the  relation  A^A^l. 
Determine  Bt  so  as  to  make 


4+5) 


^iS+S+^i 


A 


4sinJ(^-^i)8ini(^-^)8inJ(^s-^i) 
equal  to  either  member  of  the  last  equation. 

It  is  not  necessary  that  the  times  for  which  Ev  E%9  EZt  rlt  rv  r8, 
are  calculated  should  precisely  agree  with  the  times  of  observation 
corrected  for  aberration.  Let  the  former  be  represented  by  t^,  t^t  £,', 
and  the  latter  by  tx",  tt",  tf;  and  let 

AlogT^lpg^'-O-log^-V^ 

AlogTs=log(V-0-log(V-0 
We  may  find  fiv  Bs,  Av  AZt  Bv  as  above,  using  *J/,  t2't  $,'•  and  then  use 
A  log  rv  A  log  ts  to  correct  their  values,  as  in  §  VIII. 


Numerical  Example. 

To  illustrate  the  numerical  computations  we  have  chosen  the 
following  example,  both  on  account  of  the  large  heliocentric  motion, 
and  because  Gauss  and  Oppolzer  have  treated  the  same  data  by  their 
different  methods. 

The  data  are  taken  from  the  Theoria  Motus,  §  159,  viz., 


Times,  1805,  September 
Longitudes  of  Ceres     - 
Latitudes  of  Ceres 
Longitudes  of  the  Earth 
Logs  of  the  Sun's  distance 


551336 

95*  32?  18*56 

-0°50/34"-06 

342*  54' 56* -00 

0*0031514 


13912711 
99°  4V  5*87 
+  7°  16'  36" -80 
117M2'43"*25 

9*9929861 


265*39813 
118*  5' 28*85 
+r  38' 49*39 
241°  58'  50*  -71 

0*0056974 


The  positions  of  Ceres  have  been  freed  from  the  effects  of  parallax 
and  aberration. 
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I. 
From  the  given  times  we  obtain  the  following  values  : 


139 


Numbers. 

Logarithms. 

«1-*1 

188-91375 

21268252 

**-** 

125*97102 

21002706 

h-h 

259  88477 

2*4147809 

^1 

•4847187 

9-6854897 

A* 

•5152812 

9-7120443 

rx 

•8358520 

T, 

•8624066 

*i 

9-6692113 

A, 

•3183722 

*l 

95623916 

Control: 


4A+AS+4A-2-4959086 
iTlT,= 24959081 

II 


From  the  given  positions  we 

get: 

logX, 

9-9885516 

+ 

log** 

9-6531725 

_ 

log*, 

9-6775810 

~ 

logF, 

9-47U748 

- 

-   logF, 

9-9420444 

+ 

logF, 

9-9515547 

"" 

Zx 

0 

z* 

0 

z% 

0 

l°g*i 

8-9845270 

- 

log*, 

9-2282738 

- 

log*, 

9*6690294 

- 

log** 

9*9979027 

+ 

log* 

9-9900800 

+ 

log* 

9*9416855 

+ 

iogr, 

8*2387150 

- 

log*, 

91026549 

+ 

logfc 

91240813 

+ 

** 

-3874081 

- 

«■-& 

•9314223 

+ 

«.-«. 

•5599304 

- 

rt 

•8645836 

+ 

Pf 

1006681 

+ 

9f 

•7130624 

+ 

III. 

The  preceding  computations  furnish  the  numerical  values  for  the 
equations  Hllt  HI',  HI,,  III",  IIIS>  III'",  which  follow.  Brackets 
indicate  that  logarithms  have  been  substituted  for  numbers. 

We  have  now  to  assume  some  values  for  the  heliocentric  distances 
ri»  rs»  rs*  A  mean  proportional  between  the  mean  distances  of  Mars 
and  Jupiter  from  the  Sun  suggests  itself  as  a  reasonable  assumption. 
In  order,  however,  to  test  the  convergence  of  the  computations,  when 
the  assumptions  are  not  happy,  we  will  make  the  much  less  probable 
assumption  (actually  much  farther  from  the  truth)  that  the  helio- 
centric distances  are  an  arithmetical  mean  between  the  distances  of 
Mars  and  Jupiter.  This  gives  '526  for  the  logarithm  of  each  of  the 
distances  rl9  r2,  r8.  From  these  assumed  values  we  compute  the  first 
column  of  numbers  in  the  three  following  tables : 


Digitized  by 


Google 


140 


DETERMINATION  OF  ELLIPTIC  ORBITS. 


g!=/>,--3874081  a,=  -[8-670Ol67](g1-9-5901555)(l+.R1) 
*•!*=?,*+  8645886  &=  [9-6883924](g,+  •O90O652)(l+fi1) 
ii1=[9-6692118]r,-»  y,«-[7-9242047](g1+  -8874081X1+^X1 
a'=  - -046775 -[8-67002]£I-P'al 
#-  •482888+[9-68889]iJ1 
y  -  -  -008899  -  [7-92420]^ 


in. 


[•477l2Rgl 


Aft 

-•66781 

-•04668 

-  -0010434 

+  •0000006 

ft 

+ 

3-22606 

2-66876 

2*61317 

2-6142134 

2*6142140 

logrj 

+ 

•62600 

•484860 

•4280791 

•4282876 

•4282377 

kg*, 

+ 

8-00121 

8*864381 

8*8849740 

8*3844986 

logd  +  1*,) 

+ 

•00688 

•000284 

•0104122 

•0104010 

logi* 

+ 

8-01967 

8*809626 

8-8907468 

8*8961467 

«i 

+ 

•80186 

•886606 

•3390606 

•8390018 

A 

+ 

1-61288 

1*807804 

1*286223 

1-2867066 

7i 

- 

-08072 

•025816 

•0249618 

•0249601 

a' 

- 

•060606 

•0668488 

/* 

+ 

•47180 

•4620942 

y 

— 

•00818 

•0079821 

9,-A+-9314223        0,=  +  [9-2282788]  (g,+ 17286820X1  -RJ\ 
♦V-^+'lOOeeSl      &  =  -[9-9900800](g1-  -0861809X1— Refill, 
B,=  [0-3188722]rf-»   y,=  -[91026549](ft-  -9314228X1 -.bJ 

o"=      169161  -  [9-22827]iil+P"a1  \ 
^,^•47712]%,       ^,,=  _^77417+[9^9008]iJf +F'/8f  [  HI" 
(  -Kjrt  y"=_i26664+[910266]iil+P"yJ 


Aft 

-•77826 

+  •006042 

+  •0013222 

+  •0000021 

<Z« 

+ 

3*84286 

2*66409 

2*669132 

2*704542 

2-6704568 

U)grt 

+ 

•62600 

•412233 

•4130738 

•4132984 

•4132937 

log  A, 

-r 

8-74087 

9*081673 

9*0791624 

9*0784920 

log(l-iy 

+ 

9*97643 

9*944142 

9*9444806 

9*9445766 

logP" 

- 

8*71411 

9199120 

9-1964270 

91944696 

«t 

+ 

•81069 

•638489 

•6397466 

•6400760 

ft 

- 

3*06879 

2172660 

2*1787280 

21803116 

y% 

~ 

•28868 

181843 

•1826486 

1827838 

a" 

-- 

•20182 

*2491864 

0" 

- 

1*08177 

1*2018221 

v 

- 

13464 

1400944 

qs=pt  - -5599304 
r$*=g,f+ -7130624 
^,=[9-5623916]^ 


-» 


a,=  -  [9-3810737](g$+ 1-5798163)(1 +Rj\ 
&=     [9-6537308](g,-  -4630521)(l+.ftMni, 
y,=     [8-8361256](g,+  -6599304)(l+jyJ 

>  -•240477-[9-38107]fl,-P"'o,] 


P477121  JLa       °'"  "  -240477-[9-38107]fl,-P"'«,  ) 
P"'~  (TTWY*      &"**  +  *50537+[9-65373]J8,-F"i8,l III" 
K        *;r*        y'"=  +-068569+t8-83613]J8,-P"'yJ 
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Afc 

-*80780 

- -04065 

+  •0025316 

+  •0000081 

9s 

+ 

3-24945 

2-44165 

2-40110 

2-4096316 

2-4036347 

logr, 

+ 

0*62600 

•412217 

•4057319 

•4061394 

•4061399 

log/?, 

+ 

7*08439 

8*325742 

83451948 

8-3439733 

log(l  +  J^ 

+ 

•00417 

•009099 

•0095106 

•0094843 

logP"' 

+ 

7-91715 

8-357016 

8-3817516 

8  3801993 

«• 

- 

117253 

•987590 

•9785152 

•9790776 

fit 

+ 

1*26749 

•910805 

•8924966 

•8936069 

7| 

+ 

•26373 

•210171 

•2075292 

•2076940 

a'" 

- 

•22847 

•2222335 

ft" 

+ 

•44441 

•4390163 

V" 

+ 

•06690 

•0650888 

IV. 

The  values  of  a,  ft,  etc.,  furnish  the  basis  for  the  computation  of 
the  following  quantities : 

a^ -01254  o,= -03517  Oj- -07232 
6j  -  +  -01726  6,  =  -  -00525  6S  =  -  -00845 
Cj= -15746         c,=  -08526         Cg=-O4050 

For  O  we  get  three  values  sensibly  identical    Adopting  the  mean, 
we  set 

(7-=  -01006. 

We  also  get  fl»--00998,        Z-02322.* 


Taking  the  values  of  av  a,,  etc.,  from  the  columns  under  III,,  IIIt> 
III,,  we  form  the  residuals 

a= -06058,        £-  -16692,        y= -05557. 
From  these,  with  the  numbers  last  computed,  we  get 

0;= --65888,        C,-=  --76983,        Cs=  -79939, 

which  might  be  used  as  corrections  for  our  values  of  gv  qt,  q9.    To 
get  more  accurate  values  for  these  corrections  we  set 

Agt-Ct-A^Afofc    or    Aft»--76983-01393(A^)1, 
which  gives  Aqt  =  -  77826. 

The  quadratic  term  diminishes  the  value  of  Aft  by  -00843.  Sub- 
tracting the  same  quantity  from  Gx  and  C,  we  get 

A&=  --66731,        Afc=  -80780. 

*  It  would  have  been  better  to  omit  altogether  the  calculation  of  H  and  L,  if  the 
small  value  of  the  latter  oould  have  been  foreseen.  In  fact,  it  will  be  found  that 
the  terms  containing  L  hardly  improve  the  oonvergenoe,  being  smaller  than  quantities 
which  have  been  neglected.  Nevertheless,  the  use  of  these  terms  in  this  example  will 
illustrate  a  process  which  in  other  oases  may  be  beneficial. 
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VI. 

Applying  these  corrections  to  the  values  of  ql9  q2,  qs  we  compute 
the  second  numerical  columns  under  equations  III1>  III^,  and  III8. 
We  do  not  go  on  to  the  computations  by  equations  III',  etc,  but 
content  ourselves  with  the  old  values  of  04,  bv  etc.,  (?,  and  Z,  which 
with  the  new  residuals 

a  =  -  012595,        0  =  044949,        y  =  -003012, 
give  C^-  -04567,         C2= -004952,       C8=  -  -04064. 

A^=C2-Z(i)g2+iC72)Ag2=004952-.-02322(---778264--00247)Agr 

This  gives  Aqt  =  005042. 

As  the  term  containing  L  has  increased  the  value  of  Aqt  by  00009, 
we  add  this  quantity  to  Ox  and  C8,  and  get 

Aqx  =  -  -04558,        Aft  =  -  04055. 

With  these  corrections  we  compute  the  third  numerical  columns 
under  equations  III1,  etc.  This  time  we  recompute  the  quantities  a', 
etc.,  with  which  we  repeat  the  principal  computations  of  IV,  and  get 
the  new  values 

»!=  -0167215         a2=  -0335815         0,=  - -0743299 
bx  =  +  -0149145         62  =  -  0054413         bs  =  -  -0098825 
^=-•1576886         c2=--0779570         c8=- 0474318 
<?= -0090929. 

The  quantities  H  and  L  we  neglect  as  of  no  consequence  at  this 
stage  of  the  approximation. 

With  these  values  the  new  residuals 

a  =  +  -0002919,        p  =  -  0000044,        y  »  +  -0000288, 

give  Aqx = Gx  =  +  -0010434,        Aq2  =  C2  =  +  -0013222, 

Ag8=08=+ -0025316. 

These  corrections  furnish  the  basis  for  the  fourth  columns  of 
numbers  under  equations  III1,  etc.,  which  give  the  residuals 

a  =  +  0000002,        p  =  +  0000009,  y  =  +  '0000001, 

and  the  new  corrections 

Aqx=+  -0000006,        Aft  = +'0000021,  Afc=+  -0000031. 

The  corrected  values  of  qlt  q2,  qz  give 

log ria=  0-4282377,    log  r2= 04132937,  log  r8= 04061399. 

We  have  carried  the  approximation  farther  than  is  necessary  for 
the  following  correction  of  the  forrn/ida,  in  order  to  see  exactly  where 
the  uncorrected  formula  would  lead  us,  and  for  the  control  afforded 
by  the  fourth  residuals. 
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The  computations  for  the  test  of  the  uncorrected  formula  (the  first 
hypothesis)  are  as  follows : 


Number  or  mrc. 

Logarithm. 

»*1 

0-4282377 

*s 

0*4132937 

*t 

0-4061399 

AiRft- 

+ 

•01174865 

8-0699879 

B**~* 

+ 

•11980944 

9*0784911 

AtBrr9 

+ 

•01187670 

80560162 

N 

+ 

•14293479 

91551380 

•h 

+ 

1-3808476 

0*1241283 

H 

+ 

2-2796616 

0-3578704 

H 

+ 

1-3417404 

0*1276685 

$ 

+ 

2-4761248 

0-3937725 

«-«i 

+ 

1-1453772 

0*0589106 

•  -«, 

+ 

01964632 

9*2932812 

#-a, 

+ 

11343844 

0-0547602 

B 

+ 

9-5065898 

P 

+ 

0-4391732 

tanK^-Vj) 

+ 

15°  48'  KT-82 

9-4518296 

tanMt-,-1*,) 

+ 

15°  39*  36**38 

9*4476792 

tanlto-*!) 

+ 

81°  27'  4r-20 

9-7866915 

eiini(t^+V|) 

- 

8*7099387 

ewmlivi+Vi) 

+ 

8*7872701' 

tanj^+i-j) 

- 

-39°  55'  82**31 

9*9226686 

e 

+ 

8*9025438 

e 

+ 

9*9652259 

a 

+ 

0*4419546 

tanfo 

- 

-35°  41'  39**75 

9-8568809 

tan  fa 

- 

-19°  53'28**93 

9-5584981 

tan  Jr, 

- 

-  4°  13'  52**55 

8-8691380 

tan  J^ 

- 

-33*  33'    <T17 

9*8216068 

tan  42?, 

- 

- 18°  28'    6*-35 

9*5287240 

taniff, 

- 

-  3°  54'  24**21 

8-8343639 

sin  j^i 

- 

-67°    6'    (f-34 

9*9643473 

sin  jE, 

- 

-36°  56'  12**70 

9*7788272 

-ain#, 

- 

-  7°  48*  48*42 

9*1383734 

eai  sin  ^ 

- 

•3387061 

9-5298230 

sal  sin  JE, 

- 

-2209545 

9*3448029 

sol  sin  Et 

- 

-0499861 

8*6988491 

a*(B%-Ex) 

+ 

2-4226307 

0-3842872 

<A{Et-EJ 

+ 

2-3391145 

0-3690515 

T$mlo. 

+ 

23048791 

0*3626482 

Tl«*la 

+ 

21681461 

0*3360885 
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The  logarithms  of  the  calculated  values  of  the  intervals  of  time 
exceed  those  of  the  given  values  by  '0002416  for  the  first  interval  (t8) 
and  0002365  for  the  second  (rx).  Therefore,  since  the  corrections  for 
aberration  have  been  incorporated  in  the  data,  we  set  for  the  correction 
of  the  formula  (for  the  second  hypothesis) 

AlogTx=  -0002365     Alogrs=  -O002416 
This  gives    A  log  4  x=  0000026        A  log  A%  --  -0000025 
A  log  ^=-0004872      A  log  ^=-0004782      A  log  Bz  =--0004665 
The  new  values  of  the  logarithms  of  Av  As  are 

log  AY  =  9-6854923         log  Az  -  9*7120418 

Applying  these  corrections  to  equations  U\v  III,,  III,  *  we  get  the 
following : 
r1*-g1*+-8645886 


A,. 


o^  -[8-6700193](g1-9-5901555)(l  + fix) 
5  [9-6687241]^-* 

ft  -  +  [9-6833950]  (qt  +  -0900552)(1  + jty 
y1=-[7-9242073](<71+  -3874081X1+^) 


IIIj  corrected. 


*Vi 

+  -0002887 

-   -0000217 

9i 

+ 

2*5142140 

2-5145027 

2-5144810 

kg*-i 

+ 

•4282377 

•4282816 

4282782 

log  A 

+ 

8*3838110 

8-3838788 

8-3838894 

logd  +  J?,) 

+ 

•0103847 

•0103863 

•0103865 

«i 

+ 

3389010 

•3380784 

•3389796 

ft 

+ 

1-2866654 

1-2868124 

1-2868024 

ti 

- 

•0240508 

•0240619 

•0249617 

l°g«i 

+ 

1241571 

r«*=«t'+1006681 

<4»  +[9*2282738]  (gt+ 17286820X1  -  J^ 

.R1-[0-3178940]v» 

&=-[9-9900800](gt-  -0361309X1 -•««) 
y,=  -  [91026549]  (fc-  -9814223X1 -i^ 


III,  corrected 


Aft 

-   -0000955 

f  -0000187 

9s 

+ 

2-5704563 

2-5703608 

2-5708795 

logra 

+ 

0-4132987 

•4132778 

•4132809 

log*. 

+ 

9*0780129 

9-0780605 

9-0780613 

log(l-iW 

+ 

9-9446418 

9-9446353 

9-9446365 

«4 

+ 

•6401725 

•6401487 

•6401532 

ft 

- 

21806412 

2-1805261 

21805482 

7a 

- 

1827615 

1827481 

1827507 

log* 

+ 

•3579174 

*  The  corrections  may  be  made  without  rewriting  the  equations. 
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V=?,*+-7130624 

a,=  -[9-8810712](g,+  l-5798163)(l  +  jy 

JR,-[9-5619261]r,-» 

&=+[9-6537283](g,-  -4680521X1 +RJ 
y,= +[8-8861231](g,+  •5599304)(1+.R1) 


■III,  corrected. 


&qt 

4-  -0003302 

+   -0000424 

1* 

+ 

2-409*847 

24039649 

2-4040073 

l°g*i 

+ 

•4061399 

•4061929 

'4061998 

log/?, 

+ 

8*3435066 

8-3433463 

8-3433207 

logd  +  i?,) 

+ 

•0094742 

•0094708 

•0094704 

«• 

- 

•9790600 

•9791236 

•9791329 

ft 

+ 

•8936824 

•8937277 

•8937461 

7i 

+ 

•2076882 

•2077097 

•2077124 

log* 

+ 

1277120 

With  these  corrected  equations  the  last  values  of  ql9  qt>  qt  give 
e  residuals 

£=-..0003934  y-- -0000826 


the  residuals 

a  =  -0001135 
These  give  the  corrections 

Agt  =  -0002887        Afc  =  -  -0000955 
The  next  residuals  are 

a  =0000035  £=-0000140 

which  give  the  corrections 

Aft=-O0002l7    Aft=-0000187 
The  next  residuals  are 

a=-O000001        £=-0000003 
which  must  be  regarded  as  entirely  insensible. 


Afc= -0003802 
y=--0000003 

Afc= -0000424 
y=0000000 


IX,  X. 

It  remains  to  determine  the  ellipse  which  passes  through  the  points 
to  which  the  numbers  relate  in  the  last  columns  under  the  corrected 
equations  IIIlf  III,,  III8,  and  also  the  time  of  perihelion  passage. 
The  computations  are  as  follows : 
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XL 
This  gives  the  following  equations  for  an  ephemeris : 
T=1806,  June,  23*97450,  Paris  mean  time 
[2-8863186](*  -  T) •= J£to  wmmA  -  [4-2216270]  sin  E 
Heliocentrio  coordinates  relating  to  the  eoliptio. 
x  =  + 1820700  -  [0-3530366]  cos  tf-  [0*1827457]  sin  E 
y  «  -  1244685 +[01878576]  cos  J?- [03603257]  sin  E 
z=  --0373970+[9-6656346] cos  J£+[93320292]sin E 
The  differences  of  the  values  of  T{1>  T(t>  T{^  from  their  mean  T, 
indicate  the  residual  errors  of  this  hypothesis.    They  indicate  differ- 
ences in  the  calculated  and  the  observed  geocentric  positions  which 
are  represented  by  the  geocentric  angles  subtended  by  the   path 
described  by  the  planet  in  the  following  fractions  of  a  day :  *00054, 
•00003,  -00052.     Since  the  heliocentric  motion  of  the  planet  is  about 
one-fourtn  of  a  degree  per  day,  and  the  planet  is  considerably  farther 
from  the  earth  than  from  the  sun  at  the  times  of  the  first  and  third 
observations,  the  errors  will  be  less  than  half  a  second  in  arc 

If  we  desire  all  the  accuracy  possible  with  seven-figure  logarithms, 
we  may  form  a  third  hypothesis  based  on  the  following  corrections : 

AlogTl=4f%^>=  - -0000017, 

AlogT8=3f%^^=  -O000018. 

The  equations  for  an  ephemeris  will  then  be : 

T=1806,  June  23*96378,  Paris  mean  time 

[2-8863140](<-  T)= JE'toMeond,-[4-2216530]  sin  E 

Heliooentrio  coordinates  relating  to  the  eoliptic. 

x  =  +  -1820765  -  [0-3530261]  cos  E-  [0-1827783]  sin  E 

y  m.  -  1244853 + [0*1878904]  cos  E  -  [0*3603153]  sin  E 

z=-  -0373987 + [96656285]  cos  E+  [9*3320758]  sin  E 

The  agreement  of  the  calculated  geocentric  positions  with  the  data 

is  shown  in  the  following  table : 


Times,  1806,  September 

5*51336 

139*42711 

265*30813 

Second  hypothesis : 
longitudes     • 
errors   - 
latitudes 
errors    - 

95*  32*  18*-88 
0*32 

-<T59*34*-01 
0**06 

99*49'   5*-87 
(f-00 

r  16'  3e*-82 

(f-02 

118*   5' 28*52 

-<r-33 

7°  38'  49*34 
-(T-05 

Third  hypothesis : 
longitudes     • 
errors   • 
latitudes 
errors    - 

95°32'18*-65 

•roe 

-<r5e*34"-04 

(r-02 

99*49'   5*82 

-<r«05 

r  16'  36**78 

-<r-02 

118°   5' 28*79 

-<T06 

r  38*  49*38 

-(TOl 
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The  immediate  result  of  each  hypothesis  is  to  give  three  positions  of 
the  planet,  from  which,  with  the  times,  the  orbit  may  be  calculated  in 
various  ways,  and  with  different  results,  so  far  as  the  positions  deviate 
from  the  truth  on  account  of  the  approximate  nature  of  the  hypothesis. 
In  some  respects,  therefore,  the  correctness  of  an  hypothesis  is  best 
shown  by  the  values  of  the  geocentric  or  heliocentric  distances  which 
are  derived  directly  from  it  The  logarithms  of  the  heliocentric 
distances  are  brought  together  in  the  following  table,  and  correspond- 
ing values  from  Gauss*  and  Oppolzert  are  added  for  comparison.  It  is 
worthy  of  notice  that  the  positions  given  by  our  second  hypothesis  are 
substantially  correct,  and  if  the  orbit  had  been  calculated  from  the 
first  and  third  of  these  positions  with  the  interval  of  time,  it  would 
have  left  little  to  be  desired. 


logr,. 

logrt. 

logr,. 

First  hypothesis 
Second  hypothesis   • 
Third  hypothesis     • 

•4282377 
•4282782 
4282786 

•4132987 
•4132809 
•4132808 

4061399 
•4061996 
•4062003 

Gauss: 

First  hypothesis 
Seoond  hypothesis     • 
Third  hypothesis 
Fourth  hypothesis    - 

'4328934 
•4291773 
•4284841 
•4282792 

•4114726 
•4129871 
•4132107 
•4132817 

•4094712 
-4071975 
•4064697 
•4062033 

Oppolcer: 

First  hypothesis 
8eoond  hypothesis    - 
Third  hypothesis 

•4281340 
•4282794 
•4282787 

•413380 
•4132801 

•4061699 
•4061976 
•4062009 

In  comparing  the  different  methods,  it  should  be  observed  that 
the  determination  of  the  positions  in  any  hypothesis  by  Gauss's 
method  requires  successive  corrections  of  a  single  independent  variable, 
a  corresponding  determination  by  Oppolzer's  method  requires  the 
successive  corrections  of  two  independent  variables,  while  the  corre- 
sponding determination  by  the  method  of  the  present  paper  requires 
the  successive  corrections  of  three  independent  variables. 


*  Theoria  motus,  $  159. 

t  Lehrbuch  xmr  Bahnbestimmung  dtr  Konuten  %md  Planeten,  2nd  ed.,  voL  i,  p.  394. 
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ON  THE  USE  OF  THE  VECTOR  METHOD  IN  THE  / 

DETERMINATION  OF  ORBITS 

Lsttsb  to  Dr.  Hugo  Buchholz,  Editor  of  Klinkerfues'  Theoretische 

Astroncmie.** 

New  Haven,  October,  1898. 
Dr.  Hugo  Buchholz, 

My  dear  Sir, — The  opinion  of  Fabritius  t  on  the  comparative  con- 
venience of  different  methods  is  entitled  to  far  more  weight  than 
mine,  for  I  am  no  astronomer,  and  have  calculated  very  few  orbits, 
none,  indeed,  except  for  the  trial  of  my  own  formulas.  The  object  of 
my  paper  was  to  show  to  astronomers,  who  are  rather  conservative 
(and  with  right,  for  astronomy  is  the  oldest  of  the  exact  sciences), 
the  advantage  in  the  use  of  vector  notations,  which  I  had  learned  in 
Physics  from  Maxwell  This  object  could  be  best  obtained,  not  by 
showing,  as  I  might  have  done,  that  much  in  the  classic  methods 
could  be  conveniently  and  perspicuously  represented  by  vector 
notations,  but  rather  by  showing  that  these  notations  so  simplify 
the  subject,  that  it  is  easy  to  construct  a  method  for  the  complete 
solution  of  the  problem.  That  the  method  given  is  the  best  possible, 
I  certainly  do  not  claim,  but  only  that  it  is  much  better  than  I  could 
have  found  without  the  use  of  vector  notations.  Some  of  the  more 
obvious  crudities  in  my  paper  have  been  corrected  in  that  of  Beebe  and 
Phillips.  X  Doubtless  many  more  remain,  even  if  the  general  method 
be  preserved. 

My  first  efforts,  however,  to  solve  the  fundamental  approximative 
equation  were  along  the  same  lines  which  Fabritius  has  followed: — 
to  set  rx=r2  and  rs=rs  in  equation  second  of  (2)  of  Fabritius,  which 
will  give  />s  and  r2,  then  to  get  rx  from  the  first  of  (3)  of  Fabritius, 
and  then  r8  either  from  equation  second  of  (3)  or  from  some  other 

*  [In  which  the  preceding  memoir,  and  also  that  of  Beebe  and  Phillips  referred  to 
below,  are  translated.] 

t[See  Fabritius,  W.  "Ueber  eine  leiohte  Method©  der  Bahnbestimmong  mit 
Zugrundelegung  des  Principe  von  Gibbe,"  also  "  Weitere  Anwendungen  dea  Gibbs'ohen 
Prinoips,"  Aetronomitche  Nachrichten,  No.  3061,  3065  (1891)]. 

♦  ["  The  Orbit  of  Swift's  comet,  1880  V,  Determined  by  Gibbs's  Vector  Method," 
W.  Beebe  and  A.  W.  Phillips.     QoulcT$  Astronomical  Journal,  vol.  ix,  Dec.  1889.] 
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which  would  serve  the  purpose,  and  then  to  find  better  values  of 
Pv  rt  ty  setting  in  equation  second  of  (2) 

the  expressions  in  brackets  denoting  numbers  derived  from  the 
approximate  values  already  found.  This  is  similar  to  or  identical  with 
the  method  of  Fabritius,  except  that  he  combines  with  it  the  principle 
of  interpolation  (for  the  first  value  in  the  third  "hypothesis").  As 
I  found  the  approximation  by  this  method  sometimes  slow  or  failing, 
notably  in  the  case  of  Swift's  comet,  1880  V,  I  tried  the  method 
published  in  my  paper.  Indeed,  it  may  be  said  that  the  method  of 
my  paper  was  constructed  to  meet  the  exigencies  of  the  case  of  the 
comet  1880  V. 

In  ordinary  cases  I  think  that  the  method  of  Fabritius  may  very 
likely  be  better  than  that  which  I  published.  The  equations  are  very 
simply  and  perspicuously  represented  in_  vector  notations.  I  shall  use 
the  notations  of  my  paper,  writing  E,  F, etc.,  for  German  letters*  To 
eliminate  fa  and  ps  from  equation  (7)  in  my  paper,  multiply  directly 
by  8j  X  5r    This  gives 

+^(l+§)«fcai)-a  (a) 

To  eliminate  p,  and  r,,  multiply  by  <E,x5,  which  gives 


.(i+^)[(«i<W+ft(&,W] 


When  we  have  found  a>  ri>  ft»  rt  ^  **  no^  necessary  to  eliminate 
any  of  them,  and  to  save  labor  in  forming  the  equation  for  p^,  fs,  I 
should  be  inclined  to  take  the  components  in  (7)  in  the  direction  of 
one  of  the  coordinate  axes,  choosing  that  one  which  is  most  nearly 
directed  towards  the  third  observed  position.     However,  I  will  write 

+^(n.^)[(^.9)+/0,(8,.9)]-o,  (c) 

where  $  may  represent  an  axis  of  coordinates,  or  (fiiXSi)  which 
would  give  Fabritius'  equation.    It  might  be  directed  towards  the  pole 

*[In  the  remainder  of  the  letter  m  here  printed  German  oapitala  have  been  anbstitnted 
for  the  E,  7,  P,  ©to.  of  the  original,  thus  making  the  notation  uniform  with  that  of 
the  paper  referred  to.] 
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of  the  ecliptic,  which  would  make  ((5^),  («,.?),  (<8s¥)  vanish, 
except  for  exceedingly  minute  quantities  depending  on  the  latitude  of 
the  sun  and  the  geocentric  coordinates  of  the  observatories,  if  these 
are  included  in  8^,  (S,,  <£,. 

The  equations  (a),  (6),  (c),  which  are  together  equivalent  to  (7),  I 
would  solve  as  follows,  almost  in  the  same  way  as  Fabritius,  but 
relying  a  little  more  on  interpolation,  and  less  on  the  convergence  of 
which  he  speaks,  which  in  special  cases  may  more  or  less  fail. 

Setting  rx=rt  and  r8=r2  in  (a),  which  thus  modified  I  shall  call  (a7), 
and  solving  this  (a')  by  "  trial  and  error,"  using  p,  as  the  independent 
variable,  as  soon  as  I  have  a  value  of  />,  which  I  think  will  give  a 
residual  of  (af)  of  the  same  order  of  smallness  as  the  effect  of  changing 

— I  and  —5  into  — j,  I  determine  from  this  value  by  (b)  and  (c),  rx  and 
ri  r$  rt 

r8,  and  then  find  the  residual  of  (a),  using  the  values  of  rv  r2,  r8 
derived  all  from  the  same  assumed  p,.     Now  using  the  last  value  of 

— ^-r -  in  my  previous  calculations  on  (a')  which  indeed  applies 

only  roughly  to  the  (a),  I  would  get  a  value  p,  which  I  would  use  for 
the  second  "  hypothesis "  in  (a).  This  will  give  a  second  residual  in 
(a),  which  will  enable  me  to  make  a  more  satisfactory  interpolation. 
As  many  more  interpolations  may  be  made  as  shall  be  found  necessary. 

Some  such  method,  which  should  perhaps  be  called  the  method  of 
Fabritius,  would,  I  think,  in  most  cases  probably  be  the  best  for 
solution  of  equation  (7). 

Of  course  I  am  quite  aware  that  the  merit  of  my  paper,  if  any,  lies 
principally  in  the  fundamental  approximation  (1).  I  will  add  a  few 
words  on  this  subject 

The  equation  may  be  written  more  symmetrically 

£1+aaa^_(l+aUS£M)* 

It  might  be  made  entirely  symmetrical  by  writing  —  t2  for  «rf. 

If  an  expression  ending  with  t*  had  been  used,  we  could  still  have 
satisfied  two  of  the  conditions  relating  to  acceleration,  and  should 
have  obtained 

?,(1+^>-^+^(1+z^>-0>        m 
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Using  an  expansion  ending  with  t*  we  can  only  satisfy  one  con- 
dition relating  to  acceleration,  say  the  second.    This  will  give 

(Gauss  uses  virtually 

T»         14.  til*    T« 
1+2r2» 

which  is  a  little  more  convenient,  but  not,  I  think,  generally  quite 
so  accurate.) 

Writing  an  equation  analogous  to  III  for  the  earth  and  subtracting 
from  (7),  Mem.  Nat.  Acad.,  we  have 

which  gives,  on  multiplication  by  01xf$s  and  SjXfiE,,  theorems  of 
Olbers  and  Lambert 

It  is  evident  that  in  general  the  error  in  I  is  of  the  fifth  order,  in 
Ha,  116,  lie  of  the  fourth,  and  in  HI  of  the  third  But  for  equal 
intervals,  the  error  in  I  is  of  the  sixth  order,  and  in  EQ  of  the  fourth. 
And  when  tj2H-ts2— 3T1i=«0,  Ha  becomes  identical  with  I,  and  its 
error  is  of  the  fifth  order. 

The  same  is  true  of  lie  in  the  corresponding  case.  It  follows  that 
when  the  intervals  are  nearly  as  5 : 8  we  should  use  Ua  or  lib  instead 
of  L  This  will  evidently  abbreviate  the  solution  given  above  as 
only  one  of  the  quantities  rv  r9  is  to  be  used. 

The  formulas  Ha,  116,  lie  may  also  be  obtained  by  the  following 
method,  which  will  show  their  relative  accuracy. 

The  interpolation  formula 

has  an  error  evidently  of  the  second  order.  If  we  multiply  by 
^ — ^£r~ — L  and  subtract  from  I,  we  get  Ha.    So  if  we  multiply  by 

24  OT  24 

we  get  116  or  He.  The  errors  due  to  using  one  of  these  equations 
instead  of  I  are  therefore  proportional  to  these  multipliers  and  very 
unequal 

Again,  in  case  of  equal  intervals,  Ha  and  lie  become  identical  with 
HI.  There  is,  therefore,  no  reason  for  using  Ua  or  He  when  the 
intervals  are  nearly  equal.  n6  is  in  this  case  much  less  accurate 
than  III. 
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It  will  be  observed  that  all  the  formulae  I,  Ila,  116,  He,  III,  may  be 
expressed  in  the  general  form 


(i+§Mi+$)*+a(i+§)*-a 


except  that  the  letters  BltB%y  Bt  have  different  values  in  the  different 
cases,  some  vanishing  in  the  more  simple  formulas.  Moreover,  if  the 
values  of  Bv  Bt,  Bt  have  been  calculated  for  I,  the  values  for  Ila,  116, 
or  lie  are  found  simply  by  subtraction  of  one  of  the  numbers  from  the 
three.  It  is  evident  that  116  will  hardly  be  useful  except  in  special 
cases,  as  in  the  determination  of  a  parabolic  orbit  in  the  failing  case  of 
Olbers'  method,  and  then  it  would  be  a  question  whether  it  would  not 
be  better  to  determine  the  orbit  from  p,  and  pp  or  p,  and  /^,  using  Ila 
or  He. 

Equations  Ila  and  lie  are  very  appropriate  for  the  determination 
of  an  elliptic  orbit  when  the  observed  motion  is  nearly  in  the  ecliptic, 
by  means  of  four  observations  with  intervals  nearly  in  the  ratio 
5:8:5. 

It  is  evident  that  the  solution  of  (7)  given  above  may  be  varied,  in 
ways  too  numerous  to  mention,  by  the  use  of  the  simpler  forms  Ila, 
lie,  or  EH  for  I  in  the  earlier  stages  of  the  work.  This  only  involves 
changing  the  values  of  Bv  Bv  Btt  in  (a),  (6)  and  (o). 

It  is  not  correct  to  say  that  in  my  expressions  for  the  ratios  of  the 
triangles  the  error  is  of  the  fifth  order  in  general,  or  for  equal  intervals, 
of  the  sixth.  If  we  write  px>  p%i  pt,  for  the  coefficients  of  9^,  91,,  9^ 
in  I,  and  X  for  the  error  of  the  equation,  we  have  exactly 

which  gives  Px^xSRj-pjSRjXSls =1x9*3, 

Now  £1  is  my  expression  for  the  ratio  of  the  triangles,  and     $»*  jL 

is  its  error.  This  is  of  the  fourth  order  in  general  (since  the  denomin- 
ator is  of  the  first),  and  for  equal  intervals,  of  the  fifth.  The  same  is 
true  of  the  two  other  ratios.    Thus  we  have 

»iX%t=P»       ^x£ 
*iX«8    pt    ft«iX9ts' 

Adding  these  equations  and  subtracting  1  [from  both  sides]  we  have 

8t1xat2H-at2x8ta-»t1xat>=j91-hyg-p1  (j^-ayxg 

«iX«,  pt  PfcHiXH,   " 

Here  the  last  term,  which  represents  the  error,  is  of  the  fifth 
order  in  general,  or  for  equal  intervals,  of   the   sixth.     But  the 
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quantity  sought  is  of  the  second  order,  and  the  relative  error  is  of 
the  third  order  in  the  general  case,  or  the  fourth  for  equal  intervals. 
It  is  precisely  this  error  which  is  most  important  in  the  case  of 
elliptic  orbits. 

It  will  be  observed  that  the  accuracy  of  the  expressions  for  the 
ratios  far J :  far^\ :  for,]  affords  no  measure  of  the  accuracy  of  the 
formula  for  the  determination  of  elliptic  orbits. 

I  think  that  this  hasty  sketch  will  illustrate  the  convenience  and 
perspicuity  of  vector  notations  in  this  subject,  quite  independently  of 
any  particular  method  which  is  chosen  for  the  determination  of  the 
orbit  What  is  the  best  method  ?  is  hardly,  I  think,  a  question  which 
admits  of  a  definite  reply.  It  certainly  depends  upon  the  ratio  of  the 
time  intervals,  their  absolute  value,  and  many  other  things. 

Yours  very  truly, 

J.  WlLLARD  GIBB8. 

PJ8. — If  we  wish  to  use  the  cwrtote  distances,  with  reference  to  the 
ecliptic  or  the  equator,  let  fa  be  defined  as  the  distance  multiplied  by 
cosine  (lat.  or  dec),  and  ^  as  a  vector  of  length  secant  (lat  or  dec). 
For  the  most  part  the  formulae  will  require  no  change,  but  the  square 
of  5i  will  be  sec*(lat.  or  dec)  instead  of  unity,  so  that  the  last  terms 
of  (8)  will  have  this  factor.  (&$,$,)  will  then  be  Gauss'  (0.1.2.), 
whereas  in  my  paper  (5i&5$)  is  Lagrange's  (C'C'C""). 

j.  w.  a. 
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VII. 

ON  THE  R6LE  OF  QUATERNIONS  IN  THE. ALGEBRA 
OF  VECTORS. 

[Nature,  vol.  xuil.  pp.  511-513,  April  2,  1891.] 

The  following  passage,  which  has  recently  come  to  my  notice,  in 
the  preface  to  the  third  edition  of  Prof.  Tait's  Quaternions  seems 
to  call  for  some  reply : 

"  Even  Prof.  Willard  Gibbs  must  be  ranked  as  one  of  the  retarders 
of  quaternion  progress,  in  virtue  of  his  pamphlet  on  Vector  Analysis, 
a  sort  of  hermaphrodite  monster,  compounded  of  the  notations  of 
Hamilton  and  of  Grassinann." 

The  merits  or  demerits  of  a  pamphlet  printed  for  private  distri- 
bution a  good  many  years  ago  do  not  constitute  a  subject  of  any  great 
importance,  but  the  assumptions  implied  in  the  sentence  quoted  are 
suggestive  of  certain  reflections  and  inquiries  which  are  of  broader 
interest,  and  seem  not  untimely  at  a  period  when  the  methods  and 
results  of  the  various  forms  of  multiple  algebra  are  attracting  so  much 
attention.  It  seems  to  be  assumed  that  a  departure  from  quaternionic 
usage  in  the  treatment  of  vectors  is  an  enormity.  If  this  assumption 
is  true,  it  is  an  important  truth ;  if  not,  it  would  be  unfortunate  if  it 
should  remain  unchallenged,  especially  when  supported  by  so  high  an 
authority.  The  criticism  relates  particularly  to  notations,  but  I 
believe  that  thore  is  a  deeper  question  of  notions  underlying  that  of 
notations.  Indeed,  if  my  offence  had  been  solely  in  the  matter  of 
notation,  it  would  have  been  less  accurate  to  describe  my  production 
as  a  monstrosity,  than  to  characterize  its  dress  as  uncouth. 

Now  what  are  the  fundamental  notions  which  are  germane  to  a 
vector  analysis?  (A  vector  analysis  is  of  course  an  algebra  for 
vectors,  or  something  which  shall  be  to  vectors  what  ordinary  algebra 
is  to  ordinary  quantities.)  If  we  pass  over  those  notions  which  are  so 
simple  that  they  go  without  saying,  geometrical  addition  (denoted 
by  +)  is,  perhaps,  first  to  be  mentioned.  Then  comes  the  product  of 
the  lengths  of  two  vectors  and  the  cosine  of  the  angle  which  they 
include.  This,  taken  negatively,  is  denoted  in  quaternions  by  Sa/3, 
where  a  and  /8  are  the  vectors.  Equally  important  is  a  vector 
at  right  angles  to  a  and  /8  (on  a  specified  side  of  their  planeX  ***! 
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representing  in  length  the  product  of  their  lengths  and  the  sine  of 
the  angle  which  they  include.  This  is  denoted  by  Va/8  in  quaternions. 
How  these  notions  are  represented  in  my  pamphlet  is  a  question  of 
very  subordinate  consequence,  which  need  not  be  considered  at 
present.  The  importance  of  these  notions,  and  the  importance  of  a 
suitable  notation  for  them,  is  not,  I  suppose,  a  matter  on  which  there 
is  any  difference  of  opinion.  Another  function  of  a  and  /8,  called 
their  product  and  written  a/S,  is  used  in  quaternions.  In  the  general 
case,  this  is  neither  a  vector,  like  Va/8,  nor  a  scalar  (or  ordinary 
algebraic  quantity),  like  Sa/8,  but  a  quaternion — that  is,  it  is  part 
vector  and  part  scalar.    It  may  be  denned  by  the  equation — 

a/8=Va/8+Sa/8. 

The  question  arises,  whether  the  quaternionic  product  can  claim  a 
prominent  and  fundamental  place  in  a  system  of  vector  analysis.  It 
certainly  does  not  hold  any  such  place  among  the  fundamental 
geometrical  conceptions  as  the  geometrical  sum,  the  scalar  product, 
or  the  vector  product.  The  geometrical  sum  a+/8  represents  the 
third  side  of  a  triangle  as  determined  by  the  sides  a  and  /8.  Va/8 
represents  in  magnitude  the  area  of  the  parallelogram  determined  by 
the  sides  a  and  /8,  and  in  direction  the  normal  to  the  plane  of  the 
parallelogram.  SyVa/8  represents  the  volume  of  the  parallelopiped 
determined  by  the  edges  a,  /8,  and  y.  These  conceptions  are  the  very 
foundations  of  geometry. 

We  may  arrive  at  the  same  conclusion  from  a  somewhat  narrower 
but  very  practical  point  of  view.  It  will  hardly  be  denied  that 
sines  and  cosines  play  the  leading  parts  in  trigonometry.  Now  the 
notations  Va/8  and  Sa/3  represent  the  sine  and  the  cosine  of  the  angle 
included  between  a  and  /8,  combined  in  each  case  with  certain  other 
simple  notions.  But  the  sine  and  cosine  combined  with  these 
auxiliary  notions  are  incomparably  more  amenable  to  analytical 
transformation  than  the  simple  sine  and  cosine  of  trigonometry, 
exactly  as  numerical  quantities  combined  (as  in  algebra)  with  the 
notion  of  positive  or  negative  quality  are  incomparably  more  amenable 
to  analytical  transformation  than  the  simple  numerical  quantities  of 
arithmetic 

I  do  not  know  of  anything  which  can  be  urged  in  favor  of  the 
quaternionic  product  of  two  vectors  as  a  fundamental  notion  in 
vector  analysis,  which  does  not  appear  trivial  or  artificial  in  com- 
parison with .  the  above  considerations.  The  same  is  true  of  the 
quaternionic  quotient,  and  of  the  quaternion  in  general. 

How  much  more  deeply  rooted  in  the  nature  of  things  are  the 
functions  Sa/3  and  Va/8  than  any  which  depend  on  the  definition 
of  a  quaternion,  will  appear  in  a  strong  light  if  we  try  to  extend 
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our  formal®  to  space- of  four  or  more  dimensions.  It  will  not  be 
claimed  that  the  notions  of  quaternions  will  apply  to  such  a  space, 
except  indeed  in  such  a  limited  and  artificial  manner  as  to  rob  them 
of  their  value  as  a  system  of  geometrical  algebra.  But  vectors  exist 
in  such  a  space,  and  there  must  be  a  vector  analysis  for  such  a  space. 
The  notions  of  geometrical  addition  -and  the  scalar  product  are 
evidently  applicable  to  such  a  space.  As  we  cannot  define  the 
direction  of  a  vector  in  space  of  four  or  more  dimensions  by  the  con- 
dition of  perpendicularity  to  two  given  vectors,  the  definition  of  Va/9, 
as  given  above,  will  not  apply  totidem  verbis  to  space  of  four  or  more 
dimensions.  But  a  little  change  in  the  definition,  which  would  make 
no  essential  difference  in  three  dimensions,  would  enable  us  to  apply 
the  idea  at  once  to  space  of  any  number  of  dimensions. 

These  considerations  are  of  a  somewhat  a  priori  nature.  It  may  be 
more  convincing  to  consider  the  use  actually  made  of  the  quaternion 
as  an  instrument  for  the  expression  of  spatial  relations.  The  principal 
use  seems  to  be  the  derivation  of  the  functions  expressed  by  Sa/3  and 
Va/9.  Each  of  these  expressions  is  regarded  by  quaternionic  writers 
as  representing  two  distinct  operations;  first,  the  formation  of  the 
product  a/8,  which  is  the  quaternion,  and  then  the  taking  out  of  this 
quaternion  the  scalar  or  the  vector  part,  as  the  case  may  be,  this 
second  process  being  represented  by  the  selective  symbol,  S  or  V. 
This  is,  I  suppose,  the  natural  development  of  the  subject  in  a  treatise 
on  quaternions,  where  the  chosen  subject  seems  to  require  that  we 
should  commence  with  the  idea  of  a  quaternion,  or  get  there  as  soon 
as  possible,  and  then  develop  everything  from  that  particular  point  of 
view.  In  a  system  of  vector  analysis,  in  which  the  principle  of  de- 
velopment is  not  thus  predetermined,  it  seems  to  me  contrary  to  good 
method  that  the  more  simple  and  elementary  notions  should  be  defined 
by  means  of  those  which  are  less  so. 

The  quaternion  affords  a  convenient  notation  for  rotations.  The 
notation  q(  )q'\  where  q  is  a  quaternion  and  the  operand  is  to  be 
written  in  the  parenthesis,  produces  on  all  possible  vectors  just  such 
changes  as  a  (finite)  rotation  of  a  solid  body.  Rotations  may  also  be 
represented,  in  a  manner  which  seems  to  leave  nothing  to  be  desired, 
by  linear  vector  functions.  Doubtless  each  method  has  advantages  in 
certain  cases,  or  for  certain  purposes.  But  since  nothing  is  more 
simple  than  the  definition  of  a  linear  vector  function,  while  the 
definition  of  a  quaternion  is  far  from  simple,  and  since  in  any  case 
linear  vector  functions  must  be  treated  in  a  system  of  vector  analysis, 
capacity  for  representing  rotations  does  not  seem  to  me  sufficient  to 
entitle  the  quaternion  to  a  place  among  the  fundamental  and  necessary 
notions  of  a  vector  analysis. 

Another  use  of  the  quaternionic  idea  is  associated  with  the  symbol  V. 
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The  quantities  written  SVa>  and  Wa\  where  a>  denotes  a  vector  having 
values  which  vary  in  space,  are  of  fundamental  importance  in  physics. 
In  quaternions  these  are  derived  from  the  quaternion  Va>  by  selecting 
respectively  the  scalar  or  the  vector  part  But  the  most  simple  and 
elementary  definitions  of  SVo>  and  Wa>  are  quite  independent  of  the 
conception  of  a  quaternion,  and  the  quaternion  Vw  is  scarcely  used 
except  in  combination  with  the  symbols  S  and  V,  expressed  or 
implied.  There  are  a  few  formulae  in  which  there  is  a  trifling  gain  in 
compactness  in  the  use  of  the  quaternion,  but  the  gain  is  very  trifling 
so  far  as  I  have  observed,  and  generally,  it  seems  to  me,  at  the  expense 
of  perspicuity. 

These  considerations  are  sufficient,  I  think,  to  show  that  the  position 
of  the  quaternionist  is  not  the  only  one  from  which  the  subject  of 
vector  analysis  may  be  viewed,  and  that  a  method  which  would  be 
monstrous  from  one  point  of  view,  may  be  normal  and  inevitable  from 
another. 

Let  us  now  pass  to  the  subject  of  notations.  I  do  not  know 
wherein  the  notations  of  my  pamphlet  have  any  special  resemblance 
to  Grassmann's,  although  the  point  of  view  from  which  the  pamphlet 
was  written  is  certainly  much  nearer  to  his  than  to  Hamilton's.  But 
this  a  matter  of  minor  consequence.  It  is  more  important  to  ask, 
What  are  the  requisites  of  a  good  notation  for  the  purposes  of  vector 
analysis  ?  There  is  no  difference  of  opinion  about  the  representation 
of  geometrical  addition.  When  wo  come  to  functions  having  an 
analogy  to  multiplication,  the  products  of  the  lengths  of  two  vectors 
and  the  cosine  of  the  angle  which  they  include,  from  any  point  of 
view  except  that  of  the  quaternionist,  seems  more  simple  than  the 
same  quantity  taken  negatively.  Therefore  we  want  a  notation  for 
what  is  expressed  by  —  Safj,  rather  than  &afi,  in  quaternions.  Shall 
the  symbol  denoting  this  function  be  a  letter  or  some  other  sign? 
and  shall  it  precede  the  vectors  or  be  placed  between  them  ?  A  little 
reflection  will  show,  I  think,  that  while  we  must  often  have  recourse 
to  letters  to  supplement  the  number  of  signs  available  for  the  ex- 
pression of  all  kinds  of  operations,  it  is  better  that  the  symbols 
expressing  the  most  fundamental  and  frequently  recurring  operations 
should  not  be  letters,  and  that  a  sign  between  the  vectors,  and,  as  it 
were,  uniting  them,  is  better  than  a  sign  before  them  in  a  case  having 
a  formal  analogy  with  multiplication.  The  case  may  be  compared 
with  that  of  addition,  for  which  a+$  is  evidently  more  convenient 
than  2(a,  £)  or  2a£  would  be.  Similar  considerations  will  apply  to 
the  function  written  in  quaternions  Va$.  It  would  seem  that  we 
obtain  the  me  plus  ultra  of  simplicity  and  convenience,  if  we  express 
the  two  functions  by  uniting  the  vectors  in  each  case  with  a  sign 
suggestive  of  multiplication.    The  particular  forms  of  the  signs  which 
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we  adopt  is  a  matter  of  minor  consequence.  In  order  to  keep  within 
the  resources  of  an  ordinary  printing  office,  I  have  used  a  dot  and  a 
cross,  which  are  already  associated  with  multiplication,  but  are  not 
needed  for  ordinary  multiplication,  which  is  best  denoted  by  the 
simple  juxtaposition  of  the  factors.  I  have  no  especial  predilection 
for  these  particular  signs.  The  use  of  the  dot  is  indeed  liable  to  the 
objection  that  it  interferes  with  its  use  as  a  separatrix,  or  instead  of  a 
parenthesis. 

If,  then,  I  have  written  a./3  and  ax/3  for  what  is  expressed  in 
quaternions  by  —  Sa/8  and  Va/8,  and  in  like  manner  V.»  and  Vx» 
for  —  SV»  and  Wa>  in  quaternions,  it  L%  because  the  natural  develop- 
ment of  a  vector  analysis  seemed  to  lead  logically  to  some  such 
notations.  But  I  think  that  I  can  show  that  these  notations  have 
some  substantial  advantages  over  the  quaternionic  in  point  of  con- 
venience. 

Any  linear  vector  function  of  a  variable  vector  p  may  be  expressed 
in  the  form — 

aX./>+/8/i./)+yi'./)»(aX+/8/i+yi')./)=#./), 
where  #  =  aX+$K+yy; 

or  in  quaternions 

-oSXp-j8SM/>-ySi7>-  -(aSX+0S/i+yS*)/>=  -^>, 
where  ^  =  aSX+j8S/i+ySv. 

If  we  take  the  scalar  product  of  the  vector  $./>,  and  another  vector 
<r,  we  obtain  the  scalar  quantity 

^.^/^^.(aA+zS/i+yF)./), 

or  in  quaternions  S<T<pp  =  &r (aSX + /38p + yS*)p. 

This  is  a  function  of  <r  and  of  p,  and  it  is  exactly  the  same  kind  of 
function  of  <r  that  it  is  of  />,  a  symmetry  which  is  not  so  clearly 
exhibited  in  the  quaternionic  notation  as  in  the  other.  Moreover,  we 
can  write  <r.#  for  <r.(ak +$t4+yy).  This  represents  a  vector  which 
is  a  function  of  <r,  viz.,  the  function  conjugate  to  $.<r;  and  o-.$./> 
may  be  regarded  as  the  product  of  this  vector  and  p.  This  is  not 
so  clearly  indicated  in  the  quaternionic  notation,  where  it  would  be 
straining  things  a  little  to  call  S<r<p  a  vector. 

The  combinations  aX,  /9/i,  etc.,  used  above,  are  distributive  with 
regard  to  each  of  the  two  vectors,  and  may  be  regarded  as  a  kind  of 
product.  If  we  wish  to  express  everything  in  terms  of  i,  j,  and  k,  $ 
will  appear  as  a  sum  of  ii,  ij,  ikt  ji,  jj,  jk,  ki>  kjt  kk,  each  with  a 
numerical  coefficient.  These  nine  coefficients  may  be  arranged  in  a 
square,  and  constitute  a  matrix ;  and  the  study  of  the  properties  of 
expressions  like  $  is  identical  with  the  study  of  ternary  matrices. 
This  expression  of  the  matrix  as  a  sum  of  products  (which  may  be 


Digitized  by  VjOOQIC 


160  QUATERNIONS  IN  THE  ALGEBRA  OF  VECTORS. 

extended  to  matrices  of  any  order)  affords  a  point  of  departure  from 
which  the  properties  of  matrices  may  be  deduced  with  the  utmost 
facility.  The  ordinary  matricular  product  is  expressed  by  a  dot,  as 
#.'4r.  Other  important  kinds  of  multiplication  may  be  defined  by 
the  equations — 

(aX)2G8M)-(aX0)<AXM),  (aX):(/8/i)=(a.i8)(X./i). 
With  these  definitions  £$£$:$  will  be  the  determinant  of  #,  and 
$x$  will  be  the  conjugate  of  the  reciprocal  of  #  multiplied  by  twice 
the  determinant  If  $  represents  the  manner  in  which  vectors  are 
affected  by  a  strain,  £$£$  will  represent  the  manner  in  which 
surfaces  are  affected,  and  £$£$:$'  the  manner  in  which  volumes 
are  affected.  Considerations  of  this  kind  do  not  attach  themselves  so 
naturally  to  the  notation  0=aS\+)8S/*  +  ySv,  nor  does  the  subject 
admit  so  free  a  development  with  this  notation,  principally  because 
the  symbol  S  refers  to  a  special  use  of  the  matrix,  and  is  very  much 
in  the  way  when  we  want  to  apply  the  matrix  to  other  uses,  or  to 
subject  it  to  various  operations. 
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QUATERNIONS  AND  THE  AUSDEHNUNQSLEHRE. 
[Nahure,  vol.  xuv.  pp.  79-82,  May  28,  1891.] 

The  year  1844  is  memorable  in  the  annals  of  mathematics  on  account 
of  the  first  appearance  on  the  printed  page  of  Hamilton's  Quaternions 
and  Orassmann's  Ausdehnungslehre.  The  former  appeared  in  the 
July,  October,  and  supplementary  numbers  of  the  Philosophical  Maga- 
zine, after  a  previous  communication  to  the  Royal  Irish  Academy, 
November  18,  1843.  This  communication  was  indeed  announced  to 
the  Council  of  the  Academy  four  weeks  earlier,  on  the  very  day  of 
Hamilton's  discovery  of  quaternions,  as  we  learn  from  one  of  his 
letters.  The  author  of  the  Avsdehnungdehre,  although  not  uncon- 
scious of  the  value  of  his  ideas,  seems  to  have  been  in  no  haste  to 
place  himself  on  record,  and  published  nothing  until  he  was  able 
to  give  the  world  the  most  characteristic  and  fundamental  part  of 
his  system  with  considerable  development  in  a  treatise  of  more  than 
300  pages,  which  appeared  in  August  1844. 

The  doctrine  of  quaternions  has  won  a  conspicuous  place  among  the 
various  branches  of  mathematics,  but  the  nature  and  scope  of  the 
AvsdehnungsUhre,  and  its  relation  to  quaternions,  seem  to  be  still 
the  subject  of  serious  misapprehension  in  quarters  where  we  naturally 
look  for  accurate  information.  Historical  justice,  and  the  interests  of 
mathematical  science,  seem  to  require  that  the  allusions  to  the  Auadehn- 
ungdehre  in  the  article  on  "Quaternions"  in  the  last  edition  of  the 
Encyclopcedia  Britannica,  and  in  the  third  edition  of  Prof.  Tait's 
Treatise  on  tycternione,  should  not  be  allowed  to  pass  without  protest 

It  is  principally  as  systems  of  geometrical  algebra  that  quaternions 
and  the  Ausdehnungslehre  come  into  comparison.  To  appreciate  the 
relations  of  the  two  systems,  I  do  not  see  how  we  can  proceed  better 
than  if  we  ask  first  what  they  have  in  common,  then  what  either 
system  possesses  which  is  peculiar  to  itself.  The  relative  extent  and 
importance  of  the  three  fields,  that  which  is  common  to  the  two 
systems,  and  those  which  are  peculiar  to  each,  will  determine  the 
relative  rank  of  the  geometrical  algebras.  Questions  of  priority  can 
only  relate  to  the  field  common  to  both,  and  will  be  much  simplified 
by  having  the  limits  of  that  field  clearly  drawn. 
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Geometrical  addition  in  three  dimensions  is  common  to  the  two 
systems,  and  seems  to  have  been  discovered  independently  both  by 
Hamilton  and  Grassmann,  as  well  as  by  several  other  persons  about 
the  same  time.  It  is  not  probable  that  any  especial  claim  for  priority 
with  respect  to  this  principle  will  be  urged  for  either  of  the  two  with 
which  we  are  now  concerned. 

The  functions  of  two  vectors  which  are  represented  in  quaternions 
by  Sa/3  and  Va£  are  common  to  both  systems  as  published  in  1844, 
but  the  quaternion  is  peculiar  to  Hamilton's.  The  linear  vector 
function  is  common  to  both  systems  as  ultimately  developed,  although 
mentioned  only  by  Grassmann  as  early  as  1844. 

To  those  already  acquainted  with  quaternions,  the  first  question  will 
naturally  be :  To  what  extent  are  the  geometrical  methods  which  are 
usually  called  quaternionic  peculiar  to  Hamilton,  and  to  what  extent 
are  they  common  to  Grassmann  ?  This  is  a  question  which  anyone 
can  easily  decide  for  himself.  It  is  only  necessary  to  run  one's  eye 
over  the  equations  used  by  quaternionic  writers  in  the  discussion  of 
geometrical  or  physical  subjects,  and  see  how  far  they  necessarily 
involve  the  idea  of  the  quaternion,  and  how  far  they  would  be  in- 
telligible to  one  understanding  the  functions  Sa/3  and  Va/3,  but  having 
no  conception  of  the  quaternion  aft,  or  at  least  could  be  made  so  by 
trifling  changes  of  notation,  as  by  writing  S  or  V  in  places  where 
they  would  not  affect  the  value  of  the  expressions.  For  such  a  test 
the  examples  and  illustrations  in  treatises  on  quaternions  would  be 
manifestly  inappropriate,  so  far  as  they  are  chosen  to  illustrate 
quaternionic  principles,  since  the  object  may  influence  the  form  of 
presentation.  But  we  may  use  any  discussion  of  geometrical  or 
physical  subjects,  where  the  writer  is  free  to  choose  the  form  most 
suitable  to  the  subject.  I  myself  have  used  the  chapters  and  sections 
in  Prof.  Tait's  QtLaternions  on  the  following  subjects :  Geometry  of 
the  straight  line  and  plane,  the  sphere  and  cyclic  cone,  surfaces  of 
the  second  degree,  geometry  of  curves  and  surfaces,  kinematics,  statics 
and  kinetics  of  a  rigid  system,  special  kinetic  problems,  geometrical 
and  physical  optics,  electrodynamics,  general  expressions  for  the  action 
between  linear  elements,  application  of  V  to  certain  physical  analogies, 
pp.  160-371,  except  the  examples  (not  worked  out)  at  the  close  of  the 
chapters. 

Such  an  examination  will  show  that  for  the  most  part  the  methods 
of  representing  spatial  relations  used  by  quaternionic  writers  are 
common  to  the  systems  of  Hamilton  and  Grassmann.  To  an  extent 
comparatively  limited,  cases  will  be  found  in  which  the  quaternionic 
idea  forms  an  essential  element  in  the  signification  of  the  equations. 

The  question  will  then  arise  with  respect  to  the  comparatively 
limited  field  which  is  the  peculiar  property  of  Hamilton,  How  im- 
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portant  are  the  advantages  to  be  gained  by  the  use  of  the  quaternion  ? 
This  question,  unlike  the  preceding,  is  one  into  which  a  personal 
equation  will  necessarily  enter.  Everyone  will  naturally  prefer  the 
methods  with  which  he  is  most  familiar ;  but  I  think  that  it  may  be 
saf ely  affirmed  that  in  the  majority  of  cases  in  this  field  the  advantage 
derived  from  the  use  of  the  quaternion  is  either  doubtful  or  very 
trifling.  There  remains  a  residuum  of  cases  in  which  a  substantial 
advantage  is  gained  by 'the  use  of  the  quaternionic  method.  Such 
cases,  however,  so  far  as  my  own  observation  and  experience  extend, 
are  very  exceptional  If  a  more  extended  and  careful  inquiry  should 
show  that  they  are  ten  times  as  numerous  as  I  have  found  them,  they 
would  still  be  exceptional 

We  have  now  to  inquire  what  we  find  in  the  Ausdehntmgslebre  in 
the  way  of  a  geometrical  algebra,  that  is  wanting  in  quaternions.  In 
addition  to  an  algebra  of  vectors,  the  Auadehnwngslehre  affords  a 
system  of  geometrical  algebra  in  which  the  point  is  the  fundamental 
element,  and  which  for  convenience  I  shall  call  Grassmann's  algebra 
of  points.  In  this  algebra  we  have  first  the  addition  of  points,  or 
quantities  located  at  points,  which  may  be  explained  as  follows.  The 
equation  aA+6B+cC+  etc.=*E+/F+  etc., 

in  which  the  capitals  denote  points,  and  the  small  letters  scalars  (or 
ordinary  algebraic  quantities),  signifies  that 

a+b+c+  etc.*=e+/+  etc., 

and  also  that  the  centre  of  gravity  of  the  weights  a,  6,  c,  etc,  at  the 
points  A,  B,  C,  etc.,  is  the  same  as  that  of  the  weights  e,  /,  eta,  at 
the  points  E,  F,  etc.  (It  will  be  understood  that  negative  weights  are 
allowed  as  well  as  positive.)  The  equation  is  thus  equivalent  to  four 
equations  of  ordinary  algebra.  In  this  Grassmann  was  anticipated  by 
Mobius  (Ba^centri6cher  Ccticvl,  1827). 

We  have  next  the  addition  of  finite  straight  lines,  or  quantities 
located  in  straight  lines  (Liniengrdssen).  The  meaning  of  the 
equation  AB+CD+  etc=EF+GH+  etc. 

will  perhaps  be  understood  most  readily,  if  we  suppose  that  each 
member  represents  a  system  of  forces  acting  on  a  rigid  body.  The 
equation  then  signifies  that  the  two  systems  are  equivalent.  An 
equation  of  this  form  is  therefore  equivalent  to  six  ordinary  equations. 
It  will  be  observed  that  the  IAniengrossen  AB  and  CD  are  not  simply 
vectors ;  they  have  not  merely  length  and  direction,  but  they  are  also 
located  each  in  a  given  line,  although  their  position  within  those 
lines  is  immaterial.  In  Clifford's  terminology,  AB  is  a  rotor,  AB+CD 
a  motor.  In  the  language  of  Prof.  Ball's  Theory  of  Screws,  AB+CD 
represents  either  a  twist  or  a  wrench. 
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We  have  next  the  addition  of  plane  surfaces  (Plcmgrdssen\  The 
equation  ABC+DEF+GHI=JEL 

signifies  that  the  plane  JEL  passes  through  the  point  common  to  the 
planes  ABC,  DEF,  and  GHI,  and  that  the  projection  by  parallel  lines 
of  the  triangle  JEL  on  any  plane  is  equal  to  the  sum  of  the  pro- 
jections of  ABC,  DEF,  and  GHI  on  the  same  plane.,  the  areas  being 
taken  positively  or  negatively  according  to  the  cyclic  order  of  the 
projected  points.  This  makes  the  equation  equivalent  to  four  ordinary 
equations. 

Finally,  we  have  the  addition  of  volumes,  as  in  the  equation 
ABCD+EFGH  =  IJEL, 
where  there  is  nothing  peculiar,  except  that  each  term  represents  the 
six-fold  volume  of  the  tetrahedron,  and  is  to  be  taken  positively  or 
negatively  according  to  the  relative  position  of  the  points. 

We  have  also  multiplications  as  follows:  The  line  (lAniengrdsse) 
AB  is  regarded  as  the  product  of  the  points  A  and  B.  The  Plcmgrosse 
ABC,  which  represents  the  double  area  of  the  triangle,  is  regarded  as 
the  product  of  the  three  points  A,  B,  and  C,  or  as  the  product  of  the 
line  AB  and  the  point  C,  or  of  BC  and  A,  or  indeed  of  BA  and  C. 
The  volume  ABCD,  which  represents  six  times  the  tetrahedron,  is 
regarded  as  the  product  of  the  points  A,  B,  C,  and  D,  or  as  the 
product  of  the  point  A  and  the  Plangrdsse  BCD,  or  as  the  product  of 
the  lines  AB  and  BC,  etc.,  etc 

This  does  not  exhaust  the  wealth  of  multiplicative  relations  which 
Grassinann  has  found  in  the  very  elements  of  geometry.  The 
following  products  are  called  regressive,  as  distinguished  from  the 
progressive,  which  have  been  described.  The  product  of  the  PUm- 
grdssen  ABC  and  DEF  is  a  part  of  the  line  in  which  the  planes  ABC 
and  DEF  intersect,  which  is  equal  in  numerical  value  to  the  product 
of  the  double  areas  of  the  triangles  ABC  and  DEF  multiplied  by  the 
sine  of.  the  angle  made  by  the  planes.  The  product  of  the  Linien- 
grosse  AB  and  the  Plangrdsse  CDE  is  the  point  of  intersection  of  the 
line  and  the  plane  with  a  numerical  coefficient  representing  the 
product  of  the  length  of  the  line  and  the  double  area  of  the  triangle 
multiplied  by  the  sine  of  the  angle  made  by  the  line  and  the  plane 
The  product  of  three  Plangrossen  is  consequently  the  point  common 
to  the  three  planes  with  a  certain  numerical  coefficient.  In  plane 
geometry  we  have  a  regressive  product  of  two  Liniengrdssen,  which 
gives  the  point  of  intersection  of  the  lines  with  a  certain  numerical 
coefficient. 

The  fundamental  operations  relating  to  the  point,  line,  and  plane 
are  thus  translated  into  analysis  by  multiplications.  The  immense 
flexibility  and  power  of  such  an  analysis  will  be  appreciated  by 
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anyone  who  considers  what  generalized  multiplication  in  connection 
with  additive  relations  has  done  in  other  fields,  as  in  quaternions,  or 
in  the  theory  of  matrices,  or  in  the  algebra  of  logic.  For  a  single 
example,  if  we  multiply  the  equation 

AB+CD+eta  =  EF+GH+etc 
by  PQ  (P  and  Q  being  any  two  points),  we  have 

AJBPQ+CDPQ+etc. «  EFPQ+GHPQ+etc, 
which  will  be  recognised  as  expressing  an  important  theorem  of 
statics. 

The  field  in  which  Grassmann's  algebra  of  points,  as  distinguished 
from  his  algebra  of  vectors,  finds  its  especial  application  and  utility 
is  nearly  coincident  with  that  in  which,  when  we  use  the  methods 
of  ordinary  algebra,  tetrahedral  or  anharmonic  coordinates  are  more 
appropriate  than  rectilinear.  In  fact,  Grassmann's  algebra  of  points 
may  be  regarded  as  the  application  of  the  methods  of  multiple  algebra 
to  the  notions  connected  with  tetrahedral  coordinates,  just  as  his  or 
Hamilton's  algebra  of  vectors  may  be  regarded  as  the  application  of 
the  methods  of  multiple  algebra  to  the  notions  connected  with  recti- 
linear coordinates.  These  methods,  however,  enrich  the  field  to 
which  they  are  applied  with  new  notions.  Thus  the  notion  of  the 
coordinates  of  a  line  in  space,  subsequently  introduced  by  Plucker, 
was  first  given  in  the  Auadehnungdehre  of  1844.  It  should  also  be 
observed  that  the  utility  of  a  multiple  algebra  when  it  takes  the  place 
of  an  ordinary  algebra  of  four  coordinates,  is  very  much  greater 
than  when  it  takes  the  place  of  three  coordinates,  for  the  same  reason 
that  a  multiple  algebra  taking  the  place  of  three  coordinates  is  very 
much  more  useful  than  one  taking  the  place  of  two.  Grassmann's 
algebra  of  points  will  always  command  the  admiration  of  geometers 
and  analysts,  and  furnishes  an  instrument  of  marvellous  power  to  the 
former,  and  in  its  general  form,  as  applicable  to  space  of  any  number 
of  dimensions,  to  the  latter.  To  the  physicist  an  algebra  of  points  is 
by  no  means  so  indispensable  an  instrument  as  an  algebra  of  vectora 

Grassmann's  algebra  of  vectors,  which  we  have  described  as  co- 
incident with  a  part  of  Hamilton's  system,  is  not  really  anything 
separate  from  his  algebra  of  points,  but  constitutes  a  part  of  it,  the 
vector  arising  when  one  point  is  subtracted  from  another.  Yet  it 
constitutes  a  whole,  complete  in  itself,  and  we  may  separate  it  from 
the  larger  system  to  facilitate  comparison  with  the  methods  of 
Hamilton. 

We  have,  then,  as  geometrical  algebras  published  in  1844,  an 
algebra  of  vectors  common  to  Hamilton  and  Grassmann,  augmented 
on  Hamilton's  side  by  the  quaternion,  and  on  Grassmann's  by 
his  algebra  of  points.     This  statement  should  be  made  with  the 
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reservation  that  the  addition  both  of  vectors  and  of  points  had 
been  given  by  earlier  writers. 

In  both  systems  as  finally  developed  we  have  the  linear  vector 
function,  the  theory  of  which  is  identical  with  that  of  strains  and 
rotations.  In  Hamilton's  system  we  have  also  the  linear  quaternion 
function,  and  in  Orassmann's  the  linear  function  applied  to  the 
quantities  of  his  algebra  of  points.  This  application  gives  those 
transformations  in  which  projective  properties  are  preserved,  the 
doctrine  of  reciprocal  figures  or  principle  of  duality,  etc.  (Orass- 
mann's theory  of  the  linear  function  is,  indeed,  broader  than  this, 
being  coextensive  with  the  theory  of  matrices;  but  we  are  here 
considering  only  the  geometrical  side  of  the  theory.) 

In  his  earliest  writings  on  quaternions,  Hamilton  does  not  discuss 
the  linear  function.  In  his  Lectures  on  Quaternions  (1853),  he  treats 
of  the  inversion  of  the  linear  vector  function,  as  also  of  the  linear 
quaternion  function,  and  shows  how  to  find  the  latent  roots  of  the 
vector  function,  with  the  corresponding  axes  for  the  case  of  real  and 
unequal  roots.  He  also  gives  a  remarkable  equation,  the  symbolic 
cubic,  which  the  functional  symbol  must  satisfy.  This  equation  is 
a  particular  case  of  that  which  is  given  in  Prof.  Cayley's  classical 
Memoir  on  the  Theory  of  Matrices  (1858),  and  which  is  called  by 
Prof.  Sylvester  the  Hamilton-  Cayley  equation.  In  his  Elements  of 
Quaternions  (1866),  Hamilton  extends  the  symbolic  equation  to  the 
quaternion  function. 

In  Qrassmann,  although  the  linear  function  is  mentioned  in  the 
first  Ausdehnungdehre,  we  do  not  find  so  full  a  discussion  of  the 
subject  until  the  second  AusdehnungsUhre  (1862),  where  he  discusses 
the  latent  roots  and  axes,  or  what  corresponds  to  axes  in  the  general 
theory,  the  whole  discussion  relating  to  matrices  of  any  order.  The 
more  difficult  cases  are  included,  as  that  of  a  strain  in  which  all 
the  roots  are  real,  but  there  is  only  one  axis  or  unchanged  direction. 
On  the  formal  side  he  shows  how  a  linear  function  may  be  repre- 
sented by  a  quotient  or  sum  of  quotients,  and  by  a  sum  of  products, 
Likkenavsd/ruck. 

More  important,  perhaps,  than  the  question  when  this  or  that 
theorem  was  first  published  is  the  question  where  we  first  find  those 
notions  and  notations  which  give  the  key  to  the  algebra  of  linear 
functions,  or  the  algebra  of  matrices,  as  it  is  now  generally  called. 
In  voL  xxxi,  p.  35,  of  Nature,  Prof.  Sylvester  speaks  of  Cayley's 
u ever-memorable "  Memoir  on  Matrices  as  constituting  "a  second 
birth  of  Algebra,  its  avatar  in  a  new  and  glorified  form,"  and  refers 
to  a  passage  in  his  Lectures  on  Universal  Algebra,  from  which,  I 
think,  we  are  justified  in  inferring  that  this  characterization  of  the 
memoir  is  largely  due  to  the  fact  that  it  is  there  shown  how  matrices 
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may  be  treated  as  extensive  quantities,  capable  of  addition  as  well  as 
of  multiplication.  This  idea,  however,  is  older  than  the  memoir  of 
1858.  The  iAlckenausdruck,  by  which  the  matrix  is  expressed  as  a 
sum  of  a  kind  of  products  (l&ckenhaltig,  or  open),  is  described  in  a  note 
at  the  end  of  the  first  Ausdehnungslehre.  There  we  have  the  matrix 
given  not  only  as  a  sum,  but  as  a  sum  of  products,  introducing  a 
multiplicative  relation  entirely  different  from  the  ordinary  multipli- 
cation of  matrices,  and  hardly  less  fruitful,  but  not  lying  nearly  so 
near  the  surface  as  the  relations  to  which  Prof.  Sylvester  refers.  The 
key  to  the  theory  of  matrices  is  certainly  given  in  the  first  Ausdehn- 
ungslehre, and  if  we  call  the  birth  of  matricular  analysis  the  second 
birth  of  algebra,  we  can  give  no  later  date  to  this  event  than  the 
memorable  year  of  1844. 

The  immediate  occasion  of  this  communication  is  the  following 
passage  in  the  preface  to  the  third  edition  of  Prof.  Tait's  Quaternions : 

"  Hamilton  not  only  published  his  theory  complete,  the  year  before 
the  first  (and  extremely  imperfect)  sketch  of  the  Ausdehnungslehre 
appeared ;  but  had  given  ten  years  before,  in  his  protracted  study  of 
Sets,  the  very  processes  of  external  and  internal  multiplication  (corre- 
sponding to  the  Vector  and  Scalar  parts  of  a  product  of  two  vectors) 
which  have  been  put  forward  as  specially  the  property  of  Grassmann." 

For  additional  information  we  are  referred  to  art  "Quaternions," 
Encyc.  Brit.,  where  we  read  respecting  the  first  Ausdehnungslehre : 

"  In  particular  two  species  of  multiplication  ('  inner '  and  '  outer ')  of 
directed  lines  in  one  plane  were  given.  The  results  of  these  two 
kinds  of  multiplication  correspond  respectively  to  the  numerical  and 
the  directed  parts  of  Hamilton's  quaternion  product.  But  Grassmann 
distinctly  states  in  his  preface  that  he  had  not  had  leisure  to  extend 
his  method  to  angles  in  space.  .  .  .  But  his  claims,  however  great 
they  may  be,  can  in  no  way  conflict  with  those  of  Hamilton,  whose 
mode  of  multiplying  couples  (in  which  the  '  inner '  and  '  outer '  multi- 
plication are  essentially  involved)  was  produced  in  1888,  and  whose 
quaternion  system  was  completed  and  published  before  Grassmann  had 
elaborated  for  press  even  the  rudimentary  portions  of  his  own  system, 
in  which  the  veritable  difficulty  of  the  whole  subject,  the  application 
to  angles  in  space,  had  not  even  been  attacked." 

I  shall  leave  the  reader  to  judge  of  the  accuracy  of  the  general 
terms  used  in  these  passages  in  comparing  the  first  AusdehnungsUhre 
with  Hamilton's  system  as  published  in  1843  or  1844.  The  specific 
statements  respecting  Hamilton  and  Grassmann  require  an  answer. 

It  must  be  Hamilton's  Theory  of  Conjugate  Functions  or  Algebraic 
Cowples  (read  to  the  Royal  Irish  Academy,  1883  and  1886,  and 
published  in  vol.  xvii  of  the  Transactions)  to  which  reference  is  made 
in  the  statements  concerning  his  "protracted  study  of  Sets"  and 
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"mode  of  multiplying  couples*'  But  I  cannot  find  anything  like 
Grassmann's  external  or  internal  multiplication  in  this  memoir,  which 
is  concerned,  as  the  title  pretty  clearly  indicates,  with  the  theory  of 
the  complex  quantities  of  ordinary  algebra. 

It  is  difficult  to  understand  the  statements  respecting  the  Ausdehn- 
ungslehre,  which  seem  to  imply  that  Grassmann's  two  kinds  of 
multiplication  were  subject  to  some  kind  of  limitation  to  a  plane. 
The  external  product  is  not  limited  in  the  first  Auadehnungstehre  even 
to  three  dimensions.  The  internal,  which  is  a  comparatively  simple 
matter,  is  mentioned  in  the  first  Ausdehnungslehre  only  in  the  preface, 
where  it  i»  defined,  and  placed  beside  the  external  product  as  relating 
to  directed  lines.  There  is  not  the  least  suggestion  of  any  difference 
in  the  products  in  respect  to  the  generality  of  their  application  to 
vectors. 

The  misunderstanding  seems  to  have  arisen  from  the  following 
sentence  in  Grassmann's  preface :  "  And  in  general,  in  the  consideration 
of  angles  in  space,  difficulties  present  themselves,  for  the  complete 
(aUseitig)  solution  of  which  I  have  not  yet  had  sufficient  leisure." 
It  is  not  surprising  that  Grassmann  should  have  required  more  time 
for  the  development  of  some  parts  of  his  system,  when  we  consider 
that  Hamilton,  on  his  discovery  of  quaternions,  estimated  the  time 
which  he  should  wish  to  devote  to  them  at  ten  or  fifteen  years  (see 
his  letter  to  Prof.  Tait  in  the  North  British  Review  for  September 
1866),  and  actually  took  several  years  to  prepare  for  the  press  as 
many  pages  as  Grassmann  had  printed  in  1844.  But  any  speculation 
as  to  the  questions  which  Grassmann  may  have  had  principally  in 
mind  in  the  sentence  quoted,  and  the  particular  nature  of  the  diffi- 
culties which  he  found  in  them,  however  interesting  from  other  points 
of  view,  seems  a  very  precarious  foundation  for  a  comparison  of  the 
systems  of  Hamilton  and  Grassmann  as  published  in  the  years 
1843-44.  Such  a  comparison  should  be  based  on  the  positive  evidence 
of  doctrines  and  methods  actually  published. 

Such  a  comparison  I  have  endeavoured  to  make,  or  rather  to 
indicate  the  basis  on  which  it  may  be  made,  so  far  as  systems  of 
geometrical  algebra  are  concerned.  As  a  contribution  to  analysis  in 
general,  I  suppose  that  there  is  no  question  that  Grassmann's  system 
is  of  indefinitely  greater  extension,  having  no  limitation  to  any 
particular  number  of  dimensions. 
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QUATERNIONS  AND  THE  ALGEBRA  OF  VECTORS. 
[Nature,  voL  xlvu.  pp.  463,  464,  Mar.  16,  1893.] 

In  a  recent  number  of  Natwre  [vol.  xlvii,  p.  151],  Mr.  McAulay 
puts  certain  questions  to  Mr.  Heaviside  and  to  me,  relating  to  a 
subject  of  such  importance  as  to  justify  an  answer  somewhat  at 
length.  I  cannot  of  course  speak  for  Mr.  Heaviside,  although  I 
suppose  that  his  views  are  not  very  different  from  mine  on  the  most 
essential  points,  but  even  if  he  shall  have  already  replied  before  this 
letter  can  appear,  I  shall  be  glad  to  add  whatever  of  force  may  belong 
to  independent  testimony. 

Mr.  McAulay  asks :  "  What  is  the  first  duty  of  the  physical  vector 
analyst  qud  physical  vector  analyst  ? "  The  answer  is  not  doubtful 
It  is  to  present  the  subject  in  such  a  form  as  to  be  most  easily 
acquired,  and  most  useful  when  acquired. 

In  regard  to  the  slow  progress  of  such  methods  towards  recognition 
and  use  by  physicists  and  others,  which  Mr.  McAulay  deplores,  it  does 
not  seem  possible  to  impute  it  to  any  want  of  uniformity  of  notation. 
I  doubt  whether  there  is  any  modern  branch  of  mathematics  which 
has  been  presented  for  so  long  a  time  with  a  greater  uniformity  of 
notation  than  quaternions. 

What,  then,  is  the  cause  of  the  fact  which  Mr.  McAulay  and  all 
of  us  deplore  ?  It  is  not  far  to  seek.  We  need  only  a  glance  at  the 
volumes  in  which  Hamilton  set  forth  his  method.  No  wonder  that 
physicists  and  others  failed  to  perceive  the  possibilities  of  simplicity, 
perspicuity,  and  brevity  which  were  contained  in  a  system  presented 
to  them  in  ponderous  volumes  of  800  pages.  Perhaps  Hamilton  may 
have  intended  these  volumes  as  a  sort  of  thesaurus,  and  we  should 
look  to  his  shorter  papers  for  a  compact  account  of  his  method.  But 
if  we  turn  to  his  earlier  papers  on  Quaternions  in  the  Philosophical 
Magazine,  in  which  principally  he  introduced  the  subject  to  the 
notice  of  his  contemporaries,  we  find  them  entitled  "  On  Quaternions; 
or  on  a  New  System  of  Imaginaries  in  Algebra,"  and  in  them  we 
find  a  great  deal  about  imaginaries,  and  very  little  of  a  vector 
analysis.  To  show  how  slowly  the  system  of  vector  analysis 
developed  itself  in  the  quaternionic  nidus,  we  need  only  say  that 
the  symbols  S,  V,  and  V  do  not  appear  until  two  or  three  years 
after  the  discovery  of  quaternions.     In  short,  it  seems  to  have  been 
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only  a  secondary  object  with  Hamilton  to  express  the  geometrical 
relations  of  vectors, — secondary  in  time,  and  also  secondary  in  this, 
that  it  was  never  allowed  to  give  shape  to  his  work. 

But  this  relates  to  the  past  In  regard  to  the  present  status,  I  beg 
leave  to  quote  what  Mr.  McAulay  has  said  on  another  occasion  (see 
Phil.  Mag.,  June  1892): — "Quaternions  differ  in  an  important  respect 
from  other  branches  of  mathematics  that  are  studied  by  mathe- 
maticians after  they  have  in  the  course  of  years  of  hard  labour  laid 
the  foundation  of  all  their  future  work.  In  nearly  all  cases  these 
branches  are  very  properly  so  called.  They  each  grow  out  of  a 
definite  spot  of  the  main  tree  of  mathematics,  and  derive  their 
sustenance  from  the  sap  of  the  trunk  as  a  whole.  But  not  so  with 
quaternions.  To  let  these  grow  in  the  brain  of  a  mathematician, 
he  must  start  from  the  seed  as  with  the  rest  of  his  mathematics 
regarded  as  a  whole.  He  cannot  graft  them  on  his  already  flourishing 
tree,  for  they  will  die  there.  They  are  independent  plants  that 
require  separate  sowing  and  the  consequent  careful  tending." 

Can  wo  wonder  that  mathematicians,  physicists,  astronomers,  and 
geometers  feel  some  doubt  as  to  the  value  or  necessity  of  something 
so  separate  from  all  other  branches  of  learning?  Can  that  be  a 
natural  treatment  of  the  subject  which  has  no  relations  to  any  other 
method,  and,  as  one  might  suppose  from  reading  some  treatises,  has 
only  occurred  to  a  single  man  ?  Or,  at  best,  is  it  not  discouraging 
to  be  told  that  in  order  to  use  the  quaternionic  method,  one  must 
give  up  the  progress  which  he  has  already  made  in  the  pursuit  of 
his  favourite  science,  and  go  back  to  the  beginning  and  start  anew 
on  a  parallel  course  ? 

1  believe,  however,  that  if  what  I  have  quoted  is  true  of  vector 
methods,  it  is  because  there  is  something  fundamentally  wrong  in 
the  presentation  of  the  subject  Of  course,  in  some  sense  and  to  some 
extent  it  is  and  must  be  true  Whatever  is  special,  accidental,  and 
individual,  will  die,  as  it  should;  but  that  which  is  universal  and 
essential  should  remain  as  an  organic  part  of  the  whole  intellectual 
acquisition.  If  that  which  is  essential  dies  with  the  accidental,  it 
must  be  because  the  accidental  has  been  given  the  prominence  which 
belongs  to  the  essential  For  myself,  I  should  preach  no  such  doctrine 
to  those  whom  I  wish  to  convert  to  the  true  faith. 

In  Italy,  they  say,  all  roads  lead  to  Borne  In  mechanics,  kine- 
matics, astronomy,  physics,  all  study  leads  to  the  consideration  of 
certain  relations  and  operations.  These  are  the  capital  notions ;  these 
should  have  the  leading  parts  in  any  analysis  suited  to  the  subject 

If  I  wished  to  attract  the  student  of  any  of  these  sciences  to  an 
algebra  for  vectors,  I  should  tell  him  that  the  fundamental  notions 
of  this  algebra  were  exactly  those  with  which  he  was  daily  con- 
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versant.  I  should  tell  him  that  a  vector  algebra  is  so  far  from  being 
any  one  man's  production  that  half  a  century  ago  several  were  already 
working  toward  an  algebra  which  should  be  primarily  geometrical  and 
not  arithmetical,  and  that  there  is  a  remarkable  similarity  in  the  results 
to  which  these  efforts  led  (see  Proc.  A.A.A.S.  for  1886,  pp.  37,  ff )  [this 
vol  p.  91,  ff.].  I  should  call  his  attention  to  the  fact  that  Lagrange 
and  Gauss  used  the  notation  (afty)  to  denote  precisely  the  same  as 
Hamilton  by  his  S(a£y),  except  that  Lagrange  limited  the  expression 
to  unit  vectors,  and  Gauss  to  vectors  of  which  the  length  is  the  secant 
of  the  latitude,  and  I  should  show  him  that  we  have  only  to  give  up 
these  limitations,  and  the  expression  (in  connection  with  the  notion 
of  geometrical  addition)  is  endowed  with  an  immense  wealth  of 
transformations.  I  should  call  his  attention  to  the  fact  that  the 
notation  [r^J,  universal  in  the  theory  of  orbits,  is  identical  with 
Hamilton's  V(p1/>2),  except  that  Hamilton  takes  the  area  as  a  vector, 
i.e.,  includes  the  notion  of  the  direction  of  the  normal  to  the  plane  of 
the  triangle,  and  that  with  this  simple  modification  (and  with  the 
notion  of  geometrical  addition  of  surfaces  as  well  as  of  lines)  this 
expression  becomes  closely  connected  with  the  first-mentioned,  and  is 
not  only  endowed  with  a  similar  capability  for  transformation,  but 
enriches  the  first  with  new  capabilities.  In  fact,  I  should  tell  him 
that  the  notions  which  we  use  in  vector  analysis  are  those  which  he 
who  reads  between  the  lines  will  meet  on  every  page  of  the  great 
masters  of  analysis,  or  of  those  who  have  probed  deepest  the  secrets  of 
nature,  the  only  difference  being  that  the  vector  analyst,  having  regard 
to  the  weakness  of  the  human  intellect,  does  as  the  early  painters 
who  wrote  beneath  their  pictures  *'  This  is  a  tree,"  "  This  is  a  horse." 

I  cannot  attach  quite  so  much  importance  as  Mr.  McAulay  to 
uniformity  of  notation.  That  very  uniformity,  if  it  existed  among 
those  who  use  a  vector  analysis,  would  rather  obscure  than  reveal 
their  connection  with  the  general  course  of  modern  thought  in 
mathematics  and  physics.  There  are  two  ways  in  which  we  may 
measure  the  progress  of  any  reform.  The  one  consists  in  counting 
those  who  have  adopted  the  shibboleth  of  the  reformers;  the  other 
measure  is  the  degree  in  which  the  community  is  imbued  with  the 
essential  principles  of  the  reform.  I  should  apply  the  broader 
measure  to  the  present  case,  and  do  not  find  it  quite  so  bad  as 
Mr.  McAulay  does. 

Tet  the  question  of  notations,  although  not  the  vital  question,  is 
certainly  important,  and  I  assure  Mr.  McAulay  that  reluctance  to 
make  unnecessary  innovations  in  notation  has  been  a  very  powerful 
motive  in  restraining  me  from  publication.  Indeed  my  pamphlet  on 
Vector  Analysis,  which  has  excited  the  animadversion  of  quater- 
nionists,   was   never   formally   published,    although    rather    widely 
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distributed,  so  long  as  I  had  copies  to  distribute,  among  those  who 
I  thought  might  be  interested  in  the  subject.  I  may  say,  however, 
since  I  am  called  upon  to  defend  my  position,  that  I  have  found 
the  notations  of  that  pamphlet  more  flexible  than  those  generally 
used.  Mr.  McAulay,  at  least,  will  understand  what  I  mean  by  this, 
if  I  say  that  some  of  the  relations  which  he  has  thought  of  sufficient 
importance  to  express  by  means  of  special  devices  (see  Proc.  RJ3.E. 
for  1890-91),  may  be  expressed  at  least  as  briefly  in  the  notations 
which  I  have  used,  and  without  special  devices.  But  I  should  not 
have  been  satisfied  for  the  purposes  of  my  pamphlet  with  any 
notation  which  should  suggest  even  to  the  careless  reader  any  con- 
nection with  the  notion  of  the  quaternion.  For  I  confess  that  one 
of  my  objects  was  to  show  that  a  system  of  vector  analysis  does  not 
require  any  support  from  the  notion  of  the  quaternion,  or,  I  may 
add,  of  the  imaginary  in  algebra. 

I  should  hardly  dare  to  express  myself  with  so  much  freedom,  if 
I  could  not  shelter  myself  behind  an  authority  which  will  not  be 


I  do  not  see  that  I  have  done  anything  very  different  from  what 
the  eminent  mathematician  upon  whom  Hamilton's  mantle  has  fallen 
has  been  doing,  it  would  seem,  unconsciously.  Contrast  the  system  of 
quaternions,  which  he  has  described  in  his  sketch  of  Hamilton's  life 
and  work  in  the  North  British  Review  for  September,  1866,  with  the 
system  which  he  urges  upon  the  attention  of  physicists  in  the  Philo- 
sophical Magazine  in  1890.  In  1866  we  have  a  great  deal  about 
imaginaries,  and  nearly  as  much  about  the  quaternion.  In  1890  we 
have  nothing  about  imaginaries,  and  little  about  the  quaternion. 
Prof.  Tait  has  spoken  of  the  calculus  of  quaternions  as  throwing  off 
in  the  course  of  years  its  early  Cartesian  trammels.  I  wonder  that 
he  does  not  see  how  well  the  progress  in  which  he  has  led  may  be 
described  as  throwing  off  the  yoke  of  the  quaternion.  A  characteristic 
example  is  seen  in  the  use  of  the -symbol  V.  Hamilton  applies  this  to 
a  vector  to  form  a  quaternion,  Tait  to  form  a  linear  vector  function. 
But  while  breathing  a  new  life  into  the  formulae  of  quaternions, 
Prof.  Tait  stands  stoutly  by  the  letter. 

Now  I  appreciate  and  admire  the  generous  loyalty  toward  one 
whom  he  regards  as  his  master,  which  has  always  led  Prof.  Tait  to 
minimise  the  originality  of  his  own  work  in  regard  to  quaternions, 
and  write  as  if  everything  was  contained  in  the  ideas  which  flashed 
into  the  mind  of  Hamilton  at  the  classic  Brougham  Bridge.  But  not 
to  speak  of  other  claims  of  historical  justice,  we  owe  duties  to  our 
scholars  as  well  as  to  our  teachers,  and  the  world  is  too  large,  and  the 
current  of  modern  thought  is  too  broad,  to  be  confined  by  the  ipse 
dixit  even  of  a  Hamilton. 
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QUATERNIONS  AND  VECTOR  ANALYSIS. 
[Nature,  vol.  XLVin.  pp.  364-367,  Aug.  17,  1893.] 

In  a  paper  by  Prof.  C.  Q.  Knott  on  u  Recent  Innovations  in  Vector 
Theory/1  of  which  an  abstract  has  been  given  in  Nature  (vol.  xlvii, 
pp.  590-593;  see  also  a  minor  abstract  on  p.  287),  the  doctrine  that 
the  quaternion  affords  the  only  sufficient  and  proper  basis  for  vector 
analysis  is  maintained  by  arguments  based  so  largely  on  the  faults 
and  deficiencies  which  the  author  has  found  in  my  pamphlet,  Elements 
of  Vector  Analysis,  as  to  give  to  such  faults  an  importance  which 
they  would  not  otherwise  possess,  and  to  make  some  reply  from  me 
necessary,  if  I  would  not  discredit  the  cause  of  non-quaternionic 
vector  analysis.  Especially  is  this  true  in  view  of  the  warm  com- 
mendation and  endorsement  of  the  paper,  by  Prof.  Tait,  which 
appeared  in  Nature  somewhat  earlier  (p.  225). 

The  charge  which  most  requires  a  reply  is  expressed  most  distinctly 
in  the  minor  abstract,  viz.,  "that  in  the  development  of  his  dyadic 
notation,  Prof.  Gibbe,  being  forced  to  bring  the  quaternion  in,  logically 
condemned  his  own  position."  This  was  incomprehensible  to  me  until 
I  received  the  original  paper,  where  I  found  the  charge  specified  as 
follows :  "  Although  Gibbs  gets  over  a  good  deal  of  ground  without 
the  explicit  recognition  of  the  complete  product,  which  is  the  difference 
of  his  '  skew '  and  '  direct '  products,  yet  even  he  recognises  in  plain 
language  the  versorial  character  of  a  vector,  brings  in  the  quaternion 
whose  vector  is  the  difference  of  a  linear  vector  function  and  its 
conjugate,  and  does  not  hesitate  to  use  the  accursed  thing  itself  in 
certain  line,  surface,  and  volume  integrals"  (Proc.  R.S.E.,  Session 
1892-3,  p.  236).  These  three  specifications  I  shall  consider  in  their 
inverse  order,  premising,  however,  that  the  epitheta  omantia  are 
entirely  my  critic's. 

The  last  charge  is  due  entirely  to  an  inadvertence.  The  integrals 
referred  to  are  those  given  at  the  close  of  the  major  abstract  in 
Nature  (p.  593).  My  critic,  in  his  original  paper,  states  quite  correctly 
that,  according  to  my  definitions  and  notations,  they  should  represent 
dyadics.  He  multiplies  them  into  a  vector,  introducing  the  vector 
under  the  integral  sign,  as  is  perfectly  proper,  provided,  of  course, 
that  the  vector  is  constant.    But  failing  to  observe  this  restriction, 
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evidently  through  inadvertence,  and  finding  that  the  resulting 
equations  (thus  interpreted)  would  not  be  true,  he  concludes  that  I 
must  have  meant  something  else  by  the  original  equations.  Now, 
these  equations  will  hold  if  interpreted  in  the  quaternionic  sense,  as  is, 
indeed,  a  necessary  consequence  of  their  holding  in  the  dyadic  sense, 
although  the  converse  would  not  be  true.  My  critic  was  thus  led, 
in  consequence  of  the  inadvertence  mentioned,  to  suppose  that  I  had 
departed  from  my  ordinary  usage  and  my  express  definitions,  and 
had  intended  the  products  in  these  integrals  to  be  taken  in  the 
quaternionic  sense.    This  is  the  sole  ground  for  the  last  charge. 

The  second  charge  evidently  relates  to  the  notations  $8  and  $x  (see 
Nature,  vol.  xlvii,  p.  592).  It  is  perfectly  true  that  I  have  used  a 
scalar  and  a  vector  connected  with  the  linear  vector  operator, 'which, 
if  combined,  would  form  a  quaternion.  I  have  not  thus  combined 
them.  Perhaps  Prof.  Knott  will  say  that  since  I  use  both  of  them  it 
matters  little  whether  I  combine  them  or  not.  If  so  I  heartily  agree 
with  him. 

The  first  charge  is  a  little  vague.  I  certainly  admit  that  vectors 
may  be  used  in  connection  with  and  to  represent  rotations  I  have  no 
objection  to  calling  them  in  such  cases  versorial.  In  that  sense 
Lagrange  and  Poinsot,  for  example,  used  versorial  vectors.  But  what 
has  this  to  do  with  quaternions?  Certainly  Lagrange  and  Poinsot 
were  not  quaternionists. 

The  passage  in  the  major  abstract  in  Nature  which  most  distinctly 
charges  me  with  the  use  of  the  quaternion  is  that  in  which  a  certain 
expression  which  I  use  is  said  to  represent  the  quaternion  operator 
q(  )q-1  (vol.  xlvii,  p.  592).  It  would  be  more  accurate  to  say  that 
my  expression  and  the  quaternionic  expression  represent  the  same 
operator.  Does  it  follow  that  I  have  used  a  quaternion  ?  Not  at  all. 
A  quaternionic  expression  may  represent  a  number.  Does  everyone 
who  uses  any  expression  for  that  number  use  quaternions?  A 
quaternionic  expression  may  represent  a  vector.  Does  everyone  who 
uses  any  expression  for  that  vector  use  quaternions  ?  A  quaternionic 
expression  may  represent  a  linear  vector  operator.  If  I  use  an 
expression  for  that  linear  vector  operator  do  I  therefore  use  quater- 
nions ?  My  critic  is  so  anxious  to  prove  that  I  use  quaternions  that 
he  uses  arguments  which  would  prove  that  quaternions  were  in 
common  use  before  Hamilton  was  born. 

So  much  for  the  alleged  use  of  the  quaternion  in  my  pamphlet. 
Let  us  now  consider  the  faults  and  deficiencies  which  have  been  found 
therein  and  attributed  to  the  want  of  the  quaternion.  The  most 
serious  criticism  in  this  respect  relates  to  certain  integrating  operators, 
which  Prof.  Tait  unites  with  Prof.  Knott  in  ridiculing.  As  definitions 
are  wearisome,  I  will  illustrate  the.  use  of  the  terms  and  notations 
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which  I  have  used  by  quoting  a  sentence  addressed  to  the  British 
Association  a  few  years  ago.    The  speaker  was  Lord  Kelvin. 

"Helmholtz  first  solved  the  problem — Given  the  spin  in  any  case 
of  liquid  motion,  to  find  the  motion.  His  solution  consists  in  finding 
the  potentials  of  three  ideal  distributions  of  gravitational  matter 
having  densities  respectively  equal  to  1/x  of  the  rectangular  com- 
ponents of  the  given  spin;  and,  regarding  for  a  moment  these 
potentials  as  rectangular  components  of  velocity  in  a  case  of  liquid 
motion,  taking  the  spin  in  this  motion  as  the  velocity  in  the  required 
motion  "  (Natwre,  vol  xxxviii,  p.  569). 

In  the  terms  and  notations  of  my  pamphlet  the  problem  and 
solution  may  be  thus  expressed : 

Given  the  curl  in  any  case  of  liquid  motion — to  find  the  motion. 

The  required  velocity  is  l/4x  of  the  curl  of  the  potential  of  the 
given  curl. 

Or,  more  briefly— The  required  velocity  is  7-  of  the  Laplacian 
of  the  given  curl. 

Or  in  purely  analytical  form — Required  a>  in  terms  of  Vx»,  when 
V.»=xO. 

Solution :         «= l/4xVxPot  Vx»= 1/4*  Lap  Vx«. 

(The  Laplacian  expresses  the  result  of  an  operation  like  that  by 
which  magnetic  force  is  calculated  from  electric  currents  distributed 
in  space.  This  corresponds  to  the  second  form  in  which  Helmholtz 
expressed  his  result.) 

To  show  the  incredible  rashness  of  my  critics,  I  will  remark  that 
these  equations  are  among  those  of  which  it  is  said  in  the  original 
paper  (Proe.  RS.E.,  Session  1892-93,  p.  225),  "Gibbs  gives  a  good 
many  equations — theorems  I  suppose  they  ape  at  being."  I  may  add 
that  others  of  the  equations  thus  characterized  are  associated  with 
names  not  less  distinguished  than  that  of  Helmholtz.  But  that  to 
which  I  wish  especially  to  call  attention  is  that  the  terms  and 
notations  in  question  express  exactly  the  notions  which  physicists 
want  to  use. 

But  we  are  told  (Natwre,  vol.  xlvii,  p.  287)  that  these  integrating 
operators  (Pot,  Lap)  are  best  expressed  as  inverse  functions  of  V. 
To  see  how  utterly  inadequate  the  Nabla  would  have  been  to  express 
the  idea,  we  have  only  to  imagine  the  exclamation  points  which  the 
members  of  the  British  Association  would  have  looked  at  each  other 
if  the  distinguished  speaker  had  said : 

Helmholtz  first  solved  the  problem — Given  the  Nabla  of  the  velocity 
in  any  case  of  liquid  motion,  to  find  the  velocity.  His  solution  was 
that  the  velocity  was  the  Nabla  of  the  inverse  square  of  Nabla  of 
the  Nabla  of  the  velocity.  Or,  that  the  velocity  was  the  inverse 
Nabla  of  the  Nabla  of  the  velocity. 
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Or,  if  the  problem  and  solution  had  been  written  thus :  Required  w 
in  terms  of  V»  when  SV«=0. 

Solution:  »=»VV-,V««V-1V«. 

My  critic  has  himself  given  more  than  one  example  of  the  unfitness 
of  the  inverse  Nabla  for  the  exact  expression  of  thought.  For 
example,  when  he  says  that  I  have  taken  "eight  distinct  steps  to 
prove  tWo  equations,  which  are  special  cases  of 

I  do  not  quite  know  what  he  means.  If  he  means  that  I  have  taken 
eight  steps  to  prove  Poisson's  Equation  (which  certainly  is  not  ex- 
pressed by  the  equation  cited,  although  it  may  perhaps  be  associated 
with  it  in  some  minds),  I  will  only  say  that  my  proof  is  not  very 
long,  especially  as  I  have  aimed  at  greater  rigor  than  is  usually 
thought  necessary.  I  cannot,  however,  compare  my  demonstration 
with  that  of  quaternionic  writers,  as  I  have  not  been  able  (doubtless 
on  account  of  insufficient  search)  to  find  any  such. 

To  show  how  little  foundation  there  is  for  the  charge  that  the 
deficiencies  of  my  system  require  to  be  pieced  out  by  these  integral 
operators,  I  need  only  say  that  if  I  wished  to  economise  operators  I 
might  give  up  New,  Lap,  and  Max,  writing  for  them  V  Pot,  VxPot, 
and  V.Pot,  and  if  I  wished  further  to  economise  in  what  costs  so  little, 
I  could  give  up  the  potential  also  by  using  the  notation  (V.V)"1  or 
V  -*.  That  is,  I  could  have  used  this  notation  without  greater  sacrifice 
of  precision  than  quaternionic  writers  seem  to  be  willing  to  make.  I 
much  prefer,  however,  to  avoid  these  inverse  operators  as  essentially 
indefinite. 

Nevertheless — although  my  critic  has  greatly  obscured  the  Bubject 
by  ridiculing  operators,  which  I  beg  leave  to  maintain  are  not  worthy 
of  ridicule,  and  by  thoughtlessly  asserting  that  it  was  necessary  for 
me  to  use  them,  whereas  they  are  only  necessary  for  me  in  the  sense 
in  which  something  of  the  kind  is  necessary  for  the  quaternionist  also, 
if  he  would  use  a  notation  irreproachable  on  the  score  of  exactness — 
I  desire  to  be  perfectly  candid.  I  do  not  wish  to  deny  that  the 
relations  connected  with  these  notations  appear  a  little  more  simple 
in  the  quaternionic  form.  I  had,  indeed,  this  subject  principally  in 
mind  when  I  said  two  years  ago  in  Natwre  (voL  xliii,  p.  512)  [this  voL 
p.  158] :  "  There  are  a  few  formula  in  which  there  is  a  trifling  gain 
in  compactness  in  the  use  of  the  quaternion.1*  Let  us  see  exactly 
how  much  this  advantage  amounts  to. 

There  is  nothing  which  the  most  rigid  quaternionist  need  object  to 
in  the  notation  for  the  potential,  or  indeed  for  the  Newtonian.  These 
represent  respectively  the  operations  by  which  the  potential  or  the 
force  of  gravitation  is  calculated  from  the  density  of  matter.  A 
quaternionist  would,  however,  apply  the  operator  New  not  only  to  a 
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scalar,  as  I  have  done,  but  to  a  vector  also.  The  vector  part  of  New  to 
(construed  in  the  quaternionic  sense)  would  be  exactly  what  I  have 
represented  by  Lap  to,  and  the  scalar  part,  taken  negatively,  would  be 
exactly  what  I  have  represented  by  Maxco.  The  quaternionist  has 
here  a  slight  economy  in  notations,  which  is  of  less  importance,  since 
all  the  operators — New,  Lap,  Max — may  be  expressed  without 
ambiguity  in  terms  of  the  potential,  which  is  therefore  the  only  one 
necessary  for  the  exact  expression  of  thought. 

But  what  are  the  formulae  which  it  is  necessary  for  one  to  remember 
who  uses  my  notations?  Evidently  only  those  which  contain  the 
operator  Pot.  For  all  the  others  are  derived  from  these  by  the  simple 
substitutions  Jfew — v  Pot, 

Lap  =VxPot, 

Max=V.Pot 
Whether  one  is  quaternionist  or  not,  one  must  remember  Poisson's 
Equation,  which  I  write 

?.VPota>=-4x<«>, 
and  in  quaternionic  might  be  written 

V*Pot«  =  4?ra>. 
If  to  is  a  vector,  in  using  my  equations  one  has  also  to  remember  the 
general  formulae, 

V.V«=VV.a>-VxVxa> 

which  as  applied  to  the  present  case  may  be  united  with  the  preceding 
in  the  three-membered  equation, 

V.VPot«=VV.  Pota>- VxVxPota>= -4™. 

This  single  equation  is  absolutely  all  that  there  is  to  burden  the 
memory  of  the  student,  except  that  the  symbols  of  differentiation 
(V,  Vx,  V.)  may  be  placed  indifferently  before  or  after  the  symbol 
for  the  potential,  and  that  if  we  choose  we  may  substitute  as  above 
New  for  VPot,  etc  Of  course  this  gives  a  good  many  equations, 
which  on  account  of  the  importance  of  the  subject  (as  they  might 
almost  be  said  to  give  the  mathematics  of  the  electro-magnetic  field) 
I  have  written  out  more  in  detail  than  might  seem  necessary.  I  have 
also  called  the  attention  of  the  student  to  many  things,  which  perhaps 
he  might  be  left  to  himself  to  see.  Prof.  Knott  says  that  the  quater- 
nionist obtains  similar  equations  by  the  simplest  transformations. 
He  has  failed  to  observe  that  the  same  is  true  in  my  Vector  Analysis, 
when  once  I  have  proved  Poisson's  Equation.  Perhaps  he  takes  his 
model  of  brevity  from  Prof.  Tait,  who  simplifies  the  subject,  I  believe, 
in  his  treatise  on  Quaternions,  by  taking  this  theorem  for  granted. 

Nevertheless,  since  I  am  forced  so  often  to  disagree  with  Prof. 
Knott,  I  am  glad  to  agree  with  him  when  I  can.     He  says  in  his 
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original  paper  (p.  226),  "No  finer  argument  in  favour  of  the  real 
quaternion  vector  analysis  can  be  found  than  in  the  tangle  and  the 
jangle  of  sections  91  to  104  in  the  Elements  of  Vector  Analysis." 
Now  I  am  quite  ready  to  plead  guilty  to  the  tangle.  The  sections 
mentioned,  as  is  sufficiently  evident  to  the  reader,  were  written  at 
two  different  times,  sections  102-1(34  being  an  addition  after  a  couple 
of  years.  The  matter  of  these  latter  sections  is  not  found  in  its 
natural  place,  and  the  result  is  well  enough  characterised  as  a  tangle. 
It  certainly  does  credit  to  the  conscientious  study  which  Prof.  Knott 
has  given  to  my  pamphlet,  that  he  has  discovered  that  there  is  a 
violent  dislocation  of  ideas  just  at  this  point.  For  such  a  fault  of 
composition  I  have  no  sufficient  excuse  to  offer,  but  I  must  protest 
against  its  being  made  the  ground  of  any  broad  conclusions  in  regard 
to  the  fundamental  importance  of  the  quaternion. 

Prof.  Knott  next  proceeds  to  criticise — or  at  least  to  ridicule — my 
treatment  of  the  linear  vector  function,  with  respect  to  which  we  read 
in  the  abstract : — "  As  developed  in  the  pamphlet,  the  theory  of  the 
dyadic  goes  over  much  the  same  ground  as  is  traversed  in  the  last 
chapter  of  Kelland  and  Tait's  Introduction  to  Qualernions.  With 
the  exception  of  a  few  of  those  lexicon  products,  for  which  Prof. 
Qibbs  has  such  an  affection,  there  is  nothing  of  real  value  added  to 
our  knowledge  of  the  linear  vector  function."  It  would  not,  I  think, 
be  difficult  to  show  some  inaccuracy  in  my  critic's  characterisation 
of  the  real  content  of  this  part  of  my  pamphlet.  But  as  algebra  is 
a  formal  science,  and  as  the  whole  discussion  is  concerning  the  best 
form  of  representing  certain  kinds  of  relations,  the  important  question 
would  seem  to  be  whether  there  is  anything  of  formal  value  in  my 
treatment  of  the  linear  vector  function. 

Now  Prof.  Knott  distinctly  characterises  in  half  a  dozen  words  the 
difference  in  the  spirit  and  method  of  my  treatment  of  this  subject 
from  that  which  is  traditional  among  quaternionists,  when  he  says 
of  what  I  have  called  dyadics — "these  are  not  quantities,  but  operators" 
(Nature,  vol  xlvii,  p.  592).  I  do  not  think  that  I  applied  the  word 
quantity  to  the  dyadics,  but  Prof.  Knott  recognised  that  I  treated 
them  as  quantities — not,  of  course,  as  the  quantities  of  arithmetic,  or 
of  ordinary  algebra,  but  as  quantities  in  the  broader  sense,  in  which, 
for  example,  quaternions  are  called  quantities.  The  fact  that  they 
may  be  operators  does  not  prevent  this.  Just  as  in  grammar  verbs 
may  be  taken  as  substantives,  viz.,  in  the  infinitive  mood,  so  in  algebra 
operators — especially  such  as  are  capable  of  quantitative  variation — 
may  be  regarded  as  quantities  when  they  are  made  the  subject  of 
algebraic  comparison  or  operation.  Now  I  would  not  say  that  it  is 
necessary  to  treat  every  kind  of  operator  as  quantity,  but  I  certainly 
think  that  one  so  important  as  the  linear  vector  operator,  and  one 
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which  lends  itself  so  well  to  such  broader  treatment,  is  worthy  of  it 
Of  course,  when  vectors  are  treated  by  the  methods  of  ordinary 
algebra,  linear  vector  operators  will  naturally  be  treated  by  the  same 
methods,  but  in  an  algebra  formed  for  the  sake  of  expressing  the 
relations  between  vectors,  and  in  which  vectors  are  treated  as  multiple 
quantities,  it  would  seem  an  incongruity  not  to  apply  the  methods 
of  multiple  algebra  also  to  the  linear  vector  operator. 

The  dyadic  is  practically  the  linear  vector  operator  regarded  as 
quantity.  More  exactly  it  is  the  multiple  quantity  of  the  ninth  order 
which  affords  various  operators  according  to  the  way  in  which  it  is 
applied.  I  will  not  venture  to  say  what  ought  to  be  included  in  a 
treatise  on  quaternions,  in  which,  of  course,  a  good  many  subjects 
would  have  claims  prior  to  the  linear  vector  operator;  but  for  the 
purposes  of  my  pamphlet,  in  which  the  linear  vector  operator  is  one 
of  the  most  important  topics,  I  cannot  but  regard  a  treatment  like 
that  in  Hamilton's  Lectures,  or  Elements,  as  wholly  inadequate  on 
the  formal  side.  To  show  what  I  mean,  I  have  only  to  compare 
Hamilton's  treatment  of  the  quaternion  and  of  the  linear  vector 
operator  with  respect  to  notations.  Since  quaternions  have  been 
identified  with  matrices,  while  the  linear  vector  operator  evidently 
belongs  to  that  class  of  multiple  quantities,  it  seems  unreasonable  to 
refuse  to  the  one  those  notations  which  we  grant  to  the  other.  Thus, 
if  the  quaternionist  has  «*,  log  q,  sin  q,  cos  q,  why  should  not  the  vector 
analyst  have  6+  log$,  sin$,  oos$,  where  $  represents  a  linear  vector 
operator  ?  I  suppose  the  latter  are  at  least  as  useful  to  the  physicist 
I  mention  these  notations  first,  because  here  the  analogy  is  most 
evident  But  there  are  other  cases  far  more  important,  because  more 
elementary,  in  which  the  analogy  is  not  so  near  the  surface,  and 
therefore  the  difference  in  Hamilton's  treatment  of  the  two  kinds  of 
multiple  quantity  not  so  evident  We  have,  for  example,  the  tensor 
of  the  quaternion,  which  has  the  important  property  represented  by 
the  equation:  T(qr)=TqTr. 

There  is  a  scalar  quantity  related  to  the  linear  vector  operator, 
which  I  have  represented  by  the  notation  |$|  and  called  the 
determinant  of  $.  It  is  in  fact  the  determinant  of  the  matrix  by 
which  $  may  be  represented,  just  as  the  square  of  the  tensor  of  q 
(sometimes  called  the  norm  of  q)  is  the  determinant  of  the  matrix  by 
which  q  may  be  represented.  It  may  also  be  defined  as  the  product 
of  the  latent  roots  of  $,  just  as  the  square  of  the  tensor  of  q  might  be 
defined  as  the  product  of  the  latent  roots  of  q.  Again,  it  has  the 
property  represented  by  the  equation 

|*.*H*II*I 

which  corresponds  exactly  with  the  preceding  equation  with  both 
sides  squared. 
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There  is  another  scalar  quantity  connected  with  the  quaternion  and 
represented  by  the  notation  Sq.  It  has  the  important  property 
expressed  by  the  equation, 

S(qr8)=S(raq)=S(8qr)t 

and  so  for  products  of  any  number  of  quaternions,  in  which  the  cyclic 
order  remains  unchanged.  In  the  theory  of  the  linear  vector  operator 
there  is  an  important  quantity  which  I  have  represented  by  the 
notation  $g,  and  which  has  the  property  represented  by  the  equation 

(#.*.Q)B=(*.Q.$)ii=(n.*.*)b, 
where  the  number  of  the  factors  is  as  before  immaterial    $B  may  be 
defined  as  the  sum  of  the  latent  roots  of  <£,  just  as  2Sq  may  be 
defined  as  the  sum  of  the  latent  roots  of  q. 

The  analogy  of  these  notations  may  be  further  illustrated   by. 
comparing  the  equations 

T(^)=e8« 

and  fdtjsse*. 

I  do  not  see  why  it  is  not  as  reasonable  for  the  vector  analyst  to 
have  notations  like  |<£  |  and  $8,  as  for  the  quaternionist  to  have  the 
notations  Tq  and  Sq. 

This  is  of  course  an  argumenttvm  ad  quaternionisten.  I  do  not 
pretend  that  it  gives  the  reason  why  I  used  these  notations,  for  the 
identification  of  the  quaternion  with  a  matrix  was,  I  think,  unknown 
to  me  when  I  wrote  my  pamphlet.  The  real  justification  of  the 
notations  |  #  |  and  $B  is  that  they  express  functions  of  the  linear 
vector  operator  qud  quantity,  which  physicists  and  others  have 
continually  occasion  to  use.  And  this  justification  applies  to  other 
notations  which  may  not  have  their  analogues  in  quaternions.  Thus 
I  have  used  $x  to  express  a  vector  so.  important  in  the  theory  of  the 
linear  vector  operator,  that  it  can  hardly  be  neglected  in  any  treatment 
of  the  subject  It  is  described,  for  example,  in  treatises  as  different  as 
Thomson  and  Tait's  Natural  Philosophy  and  Kelland  and  Tait's 
Quaternions.  In  the  former  treatise  the  components  of  the  vector 
are,  of  course,  given  in  terms  of  the  elements  of  the  linear  vector 
operator,  which  is  in  accordance  with  the  method  of  the  treatise.  In 
the  latter  treatise  the  vector  is  expressed  by 
Vaa'+V^'+Vyy'. 

As  this  supposes  the  linear  vector  operator  to  be  given  not  by  a 
single  letter,  but  by  several  vectors,  it  must  be  regarded  as  entirely 
inadequate  by  any  one  who  wishes  to  treat  the  subject  in  the  spirit  of 
multiple  algebra,  i.e.  to  use  a  single  letter  to  represent  the  linear 
vector  operator. 

But  my  critic  does  not  like  the  notations  |  $  |,  $B,  $*.    His  ridicule, 
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indeed  reaches  high-water  mark  in  the  paragraphs  in  which  he  men- 
tions them.  Concerning  another  notation,  $£$  (denned  in  Nature,  vol 
xliii,  p.  513)  [this  vol.,  p.  160],  ho  exclaims,  "  Thus  burden  after  burden, 
in  the  form  of  new  notation,  is  added  apparently  for  the  sole  purpose 
of  exercising  the  faculty  of  memory."  He  would  vastly  prefer,  it 
would  appear,  to  write  with  Hamilton  rruf>'-1,  "  where  m  represents 
what  the  unit  volume  becomes  under  the  influence  of  the  linear 
operator."  But  this  notation  is  only  apparently  compact,  since  the 
m  requires  explanation.  Moreover,  if  a  strain  were  given  in  what 
Hamilton  calls  the  standard  trinomial  form,  to  write  out  the  formula 
for  the  operator  on  surfaces  in  that  standard  form  by  the  use  of  the 
expression  m^'-1  would  require,  it  seems  to  me,  ten  (if  not  fifty)  times 
the  effort  of  memory  and  of  ingenuity,  which  would  be  required  for 
the  same  purpose  with  the  use  of  £$£$. 

I  may  here  remark  that  Prof.  Tait's  letter  of  endorsement  of  Prof. 
Knott's  paper  affords  a  striking  illustration  of  the  convenience  and 
flexibility  of  a  notation  entirely  analogous  to  $£$,  viz.,  $:$.  He 
gives  the  form  S Wx  S<nr1  to  illustrate  the  advantage  of  quaternionic 
notations  in  point  of  brevity.  If  I  understand  his  notation,  this  is 
what  I  should  write  V<r:V<r.  (I  take  for  granted  that  the  suffixes 
indicate  that  V  applies  as  differential  operator  to  <r,  and  Vx  to  <rlf 
<r  and  <rx  being  really  identical  in  meaning,  as  also  V  and  Vr)  It 
will  be  observed  that  in  my  notation  one  dot  unites  in  multiplication 
the  two  V's,  and  the  other  the  two  o-'s,  and  that  I  am  able  to  leave 
each  V  where  it  naturally  belongs  as  differential  operator.  The 
quaternionist  cannot  do  this,  because  the  V  and  cr  cannot  be*  left 
together  without  uniting  to  form  a  quaternion,  which  is  not  at  all 
wanted.  Moreover,  I  can  write  $  for  V<r,  and  $:$  for  V<r:V<r.  The 
quaternionist  also  uses  a  #,  which  is  practically  identical  with  my  $ 
(viz.,  the  operator  which  expresses  the  relation  between  da-  and  dp), 
but  I  do  not  see  how  Prof.  Knott,  who  I  suppose  dislikes  $:$  as 
much  as  $£$,  would  express  SVV1  So*^  in  terms  of  this  <f>. 

It  is  characteristic  of  Prof.  Knott's  view  of  the  subject,  that  in 
translating  into  quaternionic  from  a  dyadic,  or  operator,  as  he  calls  it, 
he  adds  in  each  case  an  operand.  In  many  cases  it  would  be  difficult 
to  make  the  translation  without  this.  But  it  is  often  a  distinct 
advantage  to  be  able  to  give  the  operator  without  the  operand.  For 
example,  in  translating  into  quaternionic  my  dyadic  or  operator  $x/>, 
he  adds  an  operand,  and  exclaims,  "  The  old  thing ! "  Certainly,  when 
this  expression  is  applied  to  an  operand,  there  is  no  advantage  (and 
no  disadvantage)  in  my  notation  as  compared  with  the  quaternionic 
But  if  the  quaternionist  wished  to  express  what  I  would  write  in  the 
form  ($xp)'\  or  |*Xp|,  or  ($X/0b,  or  (*Xp)x,  be  would,  I  think 
find  the  operand  very  much  in  the  way. 


Digitized  by  VjOOQIC 


XL 


ON  DOUBLE  REFRACTION  AND  THE  DISPERSION  OP 
COLORS  IN  PERFECTLY  TRANSPARENT  MEDIA. 

[American  Journal  of  Science,  eer.  3,  vol.  xxm,  pp.  262-275,  April,  1882.] 

1.  In  calculating  the  velocity  of  a  system  of  plane  waves  of  homo- 
geneous light,  regarded  as  oscillating  electrical  fluxes,  in  transparent 
and  sensibly  homogeneous  bodies,  whether  singly  or  doubly  refracting, 
we  may  assume  that  such  a  body  is  a  very  fine-grained  structure,  so 
that  it  can  be  divided  into  parts  having  their  dimensions  very  small 
in  comparison  with  the  wave-length,  each  of  which  may  be  regarded 
as  entirely  similar  to  every  other,  while  in  the  interior  of  each  there 
are  wide  differences  in  electrical  as  in  other  physical  properties. 
Hence,  the  average  electrical  displacement  in  such  parts  of  the  body 
may  be  expressed  as  a  function  of  the  time  and  the  coordinates  of 
position  by  the  ordinary  equations  of  wave-motion,  while  the  real 
displacement  at  any  point  will  in  general  differ  greatly  from  that 
represented  by  such  equations. 

It  is  the  object  of  this  paper  to  investigate  the  velocity  of  light  in 
perfectly  transparent  media  which  have  not  the  property  of  circular 
polarization  in  a  manner  which  shall  take  account  of  this  difference 
between  the  real  displacements  and  those  represented  by  the  ordinary 
equations  of  wave-motion.  We  shall  find  that  this  difference  will 
account  for  the  dispersion  of  colors,  without  affecting  the  validity 
of  the  laws  of  Huyghens  and  Fresnel  for  double  refraction  with 
respect  to  light  of  any  one  color. 

In  this  investigation,  it  is  assumed  that  the  electrical  displacements 
are  soUnoidal,  or,  in  other  words,  that  they  are  such  as  not  to  produce 
any  change  in  electrical  density.  The  disturbance  in  the  medium  is 
treated  as  consisting  entirely  of  such  electrical  displacements  and 
fluxes,  and  not  complicated  by  any  distinctively  magnetic  phenomena. 
It  might  therefore  be  more  accurate  to  call  the  theory  (as  here 
developed)  electrical  rather  than  eUctrwnagnetic.  The  latter  term  is 
nevertheless  retained  in  accordance  with  general  usage,  and  with  that 
of  the  author  of  the  theory. 

Since  the  velocity  which  we  are  seeking  is  equal  to  the  wave-length 
divided  by  the  period  of  oscillation,  the  problem  reduces  to  finding 
the  ratio  of  these  quantities,  and  may  be  simplified  in  some  respects 
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by  supposing  that  we  have  to  do  with  a  system  of  stationary  waves. 
That  the  relation  of  the  wave-length  and  the  period  is  the  same  for 
stationary  as  for  progressive  waves  is  evident  from  the  consideration 
that  a  system  of  stationary  waves  may  be  formed  by  two  systems  of 
progressive  waves  having  opposite  directions. 

2.  Let  x,  y,  z  be  the  rectangular  coordinates  of  any  point  in  the 
medium,  which  with  the  system  of  waves  we  may  regard  as  inde- 
finitely extended,  and  let  £+f,  fl+ff,  f+f  be  the  components  of 
electrical  displacement  at  that  point  at  the  time  t ;  £  9,  f  being  the 
average  values  of  the  components  of  electrical  displacement  at  that 
time  in  a  wave-plane  passing  through  the  point  Then  £  9,  £,  £*,  tf,  £' 
are  perfectly  defined  quantities,  of  which  £  9,  f  are  connected  with 
x,  y,  z,  and  t  by  the  ordinary  equations  of  wave-motion,  while  each 
of  the  quantities  £*,  tf,  f  has  always  zero  for  its  average  value  in  any 
wave-plane.  We  may  call  £  9,  £  the  components  of  the  regular  part 
of  the  displacement,  and  £*,  9',  f  the  components  of  the  irregular 
part  of  the  displacement  In  like  manner,  the  differential  coefficients 
of  these  quantities  with  respect  to  the  time,  £  9,  f,  £,  if,  f,  may  be 
called  respectively  the  components  of  the  regular  part  of  the  flux, 
and  the  components  of  the  irregular  part  of  the  flux. 

Let  the  whole  space  be  divided  into  elements  of  volume  Dv,  very 
small  in  all  dimensions  in  comparison  with  a  wave-length,  but 
enclosing  portions  of  the  medium  which  may  be  treated  as  entirely 
similar  to  one  another,  and  therefore  not  infinitely  small  Thus  a 
crystal  may  be  divided  into  elementary  parallelepipeds,  all  the  vertices 
of  which  are  similarly  situated  with  respect  to  the  internal  structure 
of  the  crystal  Amorphous  solids  and  liquids  may  not  be  capable  of 
division  into  equally  small  portions  of  which  physical  similarity  can 
be  predicated  with  the  same  rigor.  Tet  we  may  suppose  them  capable 
of  a  division  substantially  satisfying  the  requirements. 

From  these  definitions  it  follows  that  at  any  given  instant  the 
average  value  of  each  of  the  quantities  f,  tf,  f  in  an  element  Dt;  is 
zero.  For  the  average  value  in  one  such  element  must  be  sensibly 
the  same  as  in  any  other  situated  on  the  same  wave-plane.  If  this 
average  were  not  zero,  the  average  for  the  wave-plane  would  not  be 
zero.  Moreover,  at  any  given  instant,  the  values  of  £  9,  f  may  be 
regarded  as  constant  throughout  any  element  Dv,  and  as  representing 
the  average  values  of  the  components  of  displacement  in  that  element. 
The  same  will  be  fame  of  the  quantities  ^,  9',  f  and  £  9,  £ 

3.  Since  we  have  excluded  the  case  of  media  which  have  the  pro- 
perty of  circular  polarization,  we  shall  not  impair  the  generality  of 
our  results  if  we  suppose  that  we  have  to  do  with  linearly  polarized 
light,  i.e.,  that  the  regular  part  of  the  displacement  is  everywhere 
parallel  to  the  same  fixed  line,  all  cases  not  already  excluded  being 
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reducible  to  this.  Then,  with  the  origin  of  coordinates  and  the  zero 
of  time  suitably  chosen,  the  regular  part  of  the  displacement  may  be 
represented  by  the  equations 

£=  o  cos  2xr  cos  2x- 
l  p 


^*j8cos 2xt-  cos  2x-  , 


(1) 


P 

f  =  y  008  2xt  COS  2x- 

where  I  denotes  the  wave-length,  p  the  period  of  vibration,  a,  £,  y 
the  maximum  amplitudes  of  the  displacements  £  9,  £  and  v,  the 
distance  of  the  point  considered  from  the  wave-plane  which  passes 
through  the  origin.  Since  u  is  a  linear  function  of  x,  y,  and  z,  we 
may  regard  these  equations  as  giving  the  values  of  £  9,  £,  for  a  given 
system  of  waves,  in  terms  of  x,  y,  z,  and  t. 

4.  The  components  of  the  irregular  displacement,  £,  rft  f ,  at  any 
given  point,  will  evidently  be  simple  harmonic  functions  of  the  time, 
having  the  same  period  as  the  regular  part  of  the  displacement.  That 
they  will  also  have  the  same  phase  is  not  quite  so  evident,  and  would 
not  be  the  case  in  a  medium  in  which  there  were  any  absorption  or 
dispersion  of  light  It  will  however  appear  from  the  following  con- 
siderations that  in  perfectly  transparent  media  the  irregular  oscil- 
lations are  synchronous  with  the  regular.  For  if  they  are  not 
synchronous,  we  may  resolve  the  irregular  oscillations  into  two  parts, 
of  which  one  shall  be  synchronous  with  the  regular  oscillations,  and 
the  other  shall  have  a  difference  of  phase  of  one-fourth  of  a  complete 
oscillation.  Now  if  the  medium  is  one  in  which  there  is  no  absorption 
or  dispersion  of  light,  we  may  assume  that  the  same  electrical  con- 
figurations may  also  be  passed  through  in  the  inverse  order,  which 
would  be  represented  analytically  by  writing  —  t  for  t  in  the  equations 
which  give  £  9,  f,  £,  if,  f ,  as  functions  of  x,  y,  z,  and  t.  But  this 
change  would  not  affect  the  regular  oscillations,  nor  the  synchronous 
part  of  the  irregular  oscillations,  which  depends  on  the  cosine  of  the 
time,  while  the  non-synchronous  part  of  the  irregular  oscillations, 
which  depends  on  the  sine  of  the  time,  would  simply  have  its 
direction  reversed.  Hence,  by  taking  first  one-half  the  sum,  and 
secondly  one-half  the  difference,  of  the  original  motion  and  that 
obtained  by  substitution  of  —  t  for  t,  we  may  separate  the  non- 
synchronous  part  of  the  irregular  oscillations  from  the  rest  of  the 
motion.  Therefore,  the  supposed  non-synchronous  part  of  the  irregular 
displacement,  if  capable  of  existence,  is  at  least  wholly  independent 
of  the  wave-motion  and  need  not  be  considered  by  us. 

We  may  go  farther  in  the  determination  of  the  quantities  f,  rf9  f . 
For  in  view  of  the  very  fine-grained  structure  of  the  medium,  it  will 
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easily  appear  that  the  manner  in  which  the  general  or  average  flux 
in  any  element  Df  (represented  by  £  9,  f )  distributes  itself  among 
the  molecules  and  intermolecular  spaces  must  be  entirely  determined 
by  the  amount  and  direction  of  that  flux  and  its  period  of  oscillation. 
Hence,  and  on  account  of  the  superposable  character  of  the  motions 
which  we  are  considering,  we  may  conclude  that  the  values  of  f ,  r[t  f 
at  any  given  point  in  the  medium  are  capable  of  expression  as  linear 
functions  of  £,  7,  £  in  a  manner  which  shall  be  independent  of  the 
time  and  of  the  orientation  of  the  wave-planes  and  the  distance  of 
a  nodal  plane  from  the  point  considered,  so  long  as  the  period  of 
oscillation  remains  the  same.  But  a  change  in  the  period  may 
presumably  affect  the  relation  between  £•',  #',  f '  and  £  j/,  f  to  a  certain 
extent.  And  the  relation  between  £*,  tj',  f  and  £  j/,  f  will  vary 
rapidly  as  we  pass  from  one  point  to  another  within  the  element  Dv. 

5.  In  the  motion  which  we  are  considering  there  occur  alternately 
instants  of  no  velocity  and  instants  of  no  displacement.  The  statical 
energy  of  the  medium  at  an  instant  of  no  velocity  must  be  equal  to 
its  kinetic  energy  at  an  instant  of  no  displacement  Let  us  examine 
each  of  these  quantities,  and  consider  the  equation  which  expresses 
their  equality. 

6.  Since  in  every  part  of  an  element  Dv  the  irregular  as  well  as  the 
regular  part  of  the  displacement  is  entirely  determined  (for  light  of 
a  given  period)  by  the  values  of  £  tj,  £  the  statical  energy  of  the 
element  must  be  a  quadratic  function  of  £  tj,  £  say 

(At*+Bff+C?+E?,£+Ftf+Q£?,)Dv, 

where  A,  B,  etc.  depend  only  on  the  nature  of  the  medium  and  the 
period  of  oscillation.    At  an  instant  of  no  velocity,  when 

sin27r-=0,    and    cosf27r-=l, 
P  P 

the  above  expression  will  reduce  by  equations  (1)  to 

(Aaf-fBi8«+CV-fEi8y-fFya-faa^)cos22irjDv. 

Since  the  average  value  of  coe*2xy  in  an  indefinitely  extended 
space  is  J,  we  have  for  the  statical  energy  in  a  unit  of  Volume 

S  =  KAaf+B£HC^+E0y+Fya+Ga)8).  (2) 

7.  The  kinetic  energy  of  the  whole  medium  is  represented  by  the 
double  volume-integral* 


vffi+M+?hdVldVt, 


*  The  fluxes  are  supposed  to  be  measured  by  the  electromagnetic)  system  of  units.  It 
is  to  be  observed  that  the  difference  of  opinion  which  has  prevailed  with  respect  to  the 
estimation  of  the  energy  of  electrical  currents  docs  not  extend  to  such  as  are  tolenoidal, 
which  may  be  regarded  as  composed  of  closed  circuits. 
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where  dvx>  dvt  are  two  infinitesimal  elements  of  volume,  (^+^X, 
(( +^)g  the  corresponding  components  of  flux,  r  the  distance  between 
the  elements,  and  2  denotes  a  summation  with  respect  to  the 
coordinate  axes.  Separating  the  integrations,  we  may  write  for 
the  same  quantity 

It  is  evident  that  the  integral  within  the  brackets  is  derived  from 
£+£  by  t°e  8ame  process  by  which  the  potential  of  any  mass  is 
derived  from  its  density.  If  we  use  the  symbol  Pot  to  express  this 
relation,  we  may  write  for  the  kinetic  energy 

The  operation  denoted  by  this  symbol  is  evidently  distributive, 
so  that  Pot^+^-Pot^+Pot^.  The  expression  for  the  kinetic 
energy  may  therefore  be  expanded  into 

iS/^Pot/dv-fiZ/^Pot^dv+iS/^Pot^dv+iZ/^Pot^dt;. 

But  ^,  and  therefore  Pot^,  has  in  every  wave-plane  the  average 
value  zero.  Also  £  and  therefore  Pot£  has  in  every  wave-plane  a 
constant  value.  Therefore  the  second  and  third  integrals  in  the 
above  expression  will  vanish,  leaving  for  the  kinetic  energy 

iS/^Pot^dv+JZ/^Pot^dv,  (3) 

which  is  to  be  calculated  for  a  time  of  no  displacement,  when 

f.^.o.1,*     ^±2^cos2x«     t-±*Xm»r*.    (4) 

The  form  of  the  expression  (3)  indicates  that  the  kinetic  energy 
consists  of  two  parts,  one  of  which  is  determined  by  the  regular  part 
of  the  flux,  and  the  other  by  the  irregular  part  of  the  flux. 

8.  The  value  of  Pot  £  may  be  easily  found  by  integration,  but 
perhaps  more  readily  by  Poisson's  well-known  theorem,  that  if  q  is 
any  function  of  position  in  space  (as  the  density  of  a  certain  mass), 

cPPotg  ,  <P?otq  ,  d*Pottf        A  /rx 

where  the  direction  of  the  coordinate  axes  is  immaterial,  provided 
that  they  are  rectangular.  In  applying  this  to  Pot  £ ,  we  may  place 
two  of  the  axes  in  a  wave-plane.    This  will  give 

*£*--«■*  <«> 

In  a  nodal  plane,  Pot£=0,  since  £  has  equal  positive  and  negative 
values  in  elements  of  volume  symmetrically  distributed  with  respect 
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to  any  point  in  such  a  plane.  In  a  wave-crest  (or  plane  in  which 
£  has  a  maximum  value),  Pot£  will  also  have  a  maximum  value, 
which  we  may  call  E.  For  intermediate  points  we  may  determine 
its  value  from  the  consideration  that  the  total  disturbance  may  be 
resolved  into  two  systems  of  waves,  one  having  a  wave-crest,  and  the 
other  a  nodal  plane  passing  through  the  point  for  which  the  potential 
is  sought.  The  maximum  amplitudes  of  these  component  systems 
will    be    to    the    maximum    amplitude    of  the  original  system  as 

coe2xy  and  sm2xy  to  unity.     But  the  second  of  the  component 

systems  will  contribute  nothing  to  the  value  of  the  potential.  We 
thus  obtain 

Potf=Kcos2^, 

<PPot£        4nc*v        0   u         4?r*      .* 
^_rCS3._Kcos2^=— rPot/. 

Comparing  this  with  equation  (6),  we  have 

-^Potf«-4^, 

Pot^^.  (7) 

Hence,  and  by  equations  (4), 

i2/^Pot^dv=^2/^dt;=^(a*+i88+y8)/coe«27r^^ 

The  kinetic  energy  of  the  regular  part  of  the  flux  is  therefore,  for 
each  unit  of  volume, 

T  =  ^(««+^+y«).  (8) 

9.  With  respect  to  the  kinetic  energy  of  the  irregular  part  of  the 
flux,  it  is  to  be  observed  that,  since  £*,  fj\  f  have  their  average  values 
zero  in  spaces  which  are  very  small  in  comparison  with  a  wave-length, 
the  integrations  implied  in  the  notations  Pot  ^,  Pot  if,  Pot  ?  may  be 
confined  to  a  sphere  of  a  radius  which  is  small  in  comparison  with  a 
wave-length.  Since  within  such  a  sphere  gt  rf9  g  are  sensibly  deter- 
mined by  the  values  of  £  ij,  £  at  the  center  of  the  sphere,  which  is  the 
point  for  which  the  value  of  the  potentials  are  sought,  Pot  £*,  Pot  # ', 
Pot^  must  be  functions — evidently  linear  functions— of  &$,(;  and 
(•'Pot^,  jy'Potij',  j'Pot£'  must  be  quadratic  functions  of  the  same 
quantities.  But  these  functions  will  vary  with  the  position  of  the 
point  considered  with  reference  to  the  adjacent  molecules. 

Now  the  expression  for  the  kinetic  energy  of  the  irregular  part 
of  the  flux, 

iS/fPotfdt;, 
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indicates  that  we  may  regard  the  infinitesimal  element  dv  as  having 
the  energy  (due  to  this  part  of  the  flux) 

i2f  Votfdv. 
Let  us  consider  the  energy  due  to  the  irregular  flux  which  will  belong 
to  the  above  defined  element  Dv,  which  is  not  infinitely  small,  but 
which  has  the  advantage  of  being  one  of  physically  similar  elements 
which  make  up  the  whole  medium.  The  energy  of  this  element  is 
found  by  adding  the  energies  of  all  the  infinitesimal  elements  of 
which  it  is  composed.  Since  these  are  quadratic  functions  of  the 
quantities  £  rj,  £,  which  are  sensibly  constant  throughout  the  element 
Dv,  the  sum  will  be  a  quadratic  function  of  £  rj}  £  say 

(A/^+BV+C'fHE'^+F^-fG'^)rh;, 

which  will  therefore  represent  the  energy  of  the  element  Dv  due  to 
the  irregular  flux.  The  coefficients  A',  B',  etc,  are  determined  by  the 
nature  of  the  medium  and  the  period  of  oscillation.  They  will  be 
constant  throughout  the  medium,  since  one  element  Dv  does  not  differ 
from  another. 
This  expression  reduces  by  equations  (4)  to 

^(A'aHB^+a^+^y+Fya+G'a^cos^^Dt;. 

The  kinetic  energy  of  the  irregular  flux  in  a  unit  of  volume  is  there- 
fore o_2 

T/=^-(A'a8-fB/)88-fO/y«+E/i8y-|-Fya-fG/ai8).  (9) 

10.  Equating  the  statical  and  kinetic  energies,  we  have 
^A^  +  B^+C^+EjSy+Fya  +  Ga^) 

=  §\a2+^+y2)+^(A/aHB'^+CV-fE'i8y+Fya+G/a/3).(10) 

The  velocity  (V)  of  the  corresponding  system  of  progressive  waves  is 
given  by  the  equation 

to-P-  *    AaHBff^CY+Efty+Fytt+Qafi 
V  -j?""2x  a*+i8*+y* 

2x  AV+B'ff+Oy+E'fo+Fya+Q'aft  ,11X 

"fF J+P+i* '  (U) 

If  we  set 

°=^A-]JA''   6=^B-^B-  etc-         <12> 

and  />*=a8+/8*+y* 

the  equation  reduces  to 

m    acfi+bp+c-f+e&y+fya+gaP 

V     —  0  • 


P> 


(13) 
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For  a  given  medium  and  light  of  a  given  period,  the  coefficients  a,  6, 
etc.,  are  constant. 

This  relation  between  the  velocity  of  the  waves  and  the  direction  of 
oscillation  is  capable  of  a  very  simple  geometrical  expression.  Let  r 
be  the  radius  vector  of  the  ellipsoid 

aa?+by*+cz*+eyz+fzx+gxy  =  l.  (14) 

Then  1  =^+ty*+cz%+eyz+fzx+9xy 

If  this  radius  is  drawn  parallel  to  the  electrical  oscillations,  we  shall 

r    p       r     p        r    p 

and  V*=i.  (15) 

That  is,  the  wave-velocity  for  any  particular  direction  of  oscillation 
is  represented  in  the  ellipsoid  by  the  reciprocal  of  the  radius  vector 
which  is  parallel  to  that  direction. 

11.  This  relation  between  the  wave-length,  the  period,  and  the 
direction  of  vibration,  must  hold  true  not  only  of  such  vibrations  as 
actually  occur,  but  also  of  such  as  we  may  imagine  to  occur  under 
the  influence  of  constraints  determining  the  direction  of  vibration  in 
the  wave-plane.  The  directions  of  the  natural  or  unconstrained 
vibrations  in  any  wave-plane  may  be  determined  by  the  general 
mechanical  principle  that  if  the  type  of  a  natural  vibration  is  infini- 
tesimally  altered  by  the  application  of  a  constraint,  the  value  of  the 
period  will  be  stationary.*  Hence,  in  a  system  of  stationary  waves 
such  as  we  have  been  considering,  if  the  direction  of  an  unconstrained 
vibration  is  infinitesimally  varied  in  its  wave-plane  by  a  constraint 
while  the  wave-length  remains  constant,  the  period  will  be  stationary. k 
Therefore,  if  the  direction  of  the  unconstrained  vibration  is  infinitesi- 
mally varied  by  constraint,  and  the  period  remains  rigorously  constant, 
the  wave-length  will  be  stationary.  Hence,  if  we  make  a  central 
section  of  the  above  described  ellipsoid  parallel  to  any  wave-plane, 
the  directions  of  natural  vibration  for  that  wave-plane  will  be  parallel 
to  the  radii  vectores  of  stationary  value  in  that  section,  viz.,  to  the 
axes  of  the  ellipse,  when  the  section  is  elliptical,  or  to  all  radii,  when 
the  section  is  circular. 

12.  For  light  of  a  single  period,  our  hypothesis  has  led  to  a 
perfectly  definite  result,  our  equations  expressing  the  fundamental 
laws  of  double  refraction  as  enunciated  by  FresneL  But  if  we  ask 
how  the  velocity  of  light  varies  with  the  period,  that  is,  if  we  seek 

*8ee  Rayleigh't  Theory  qf  Sound,  vol  i,  p.  84. 
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to  derive  from  the  same  equations  the  laws  of  the  dispersion  of  colors, 
we  shall  not  be  able  to  obtain  an  equally  definite  result,  since  the 
quantities  A,  B,  etc,  and  A',  B',  etc.,  are  unknown  functions  of  the 
period.  If,  however,  we  make  the  assumption,  which  is  hardly  likely 
to  be  strictly  accurate,  but  which  may  quite  conceivably  be  not  far 
removed  from  the  truth,  that  the  manner  in  which  the  general  or 
average  flux  in  any  small  part  of  the  medium  distributes  itself  among 
the  molecules  and  intermolecular  spaces  is  independent  of  the  period, 
the  quantities  A,  B,  etc.,  and  A",  B',  etc,  will  be  constant,  and  we 
obtain  a  very  simple  relation  between  V  and  p,  which  appears  to  agree 
tolerably  well  with  the  results  of  experiment. 


If  we  set       H-A*+BP+Cy+fr+Fr«+Gaft, 


P 


'"  P* 

our  general  equation  (11)  becomes 


(16) 
(17) 


^-K-IT'  (18) 

where  H  and  H'  will  be  constant  for  any  given  direction  of  oscillation, 
when  A,  B,  etc,  and  A',  B',  etc,  are  constant  If  we  wish  to  introduce 
into  the  equation  the  absolute  index  of  refraction  (n)  and  the  wave- 
length in  vacuo  (X)  in  place  of  V  and  p,  we  may  divide  both  sides  of 
the  equation  by  the  square  of  the  constant  (h)  representing  the 
velocity  of  light  in  vacuo.    Then,  since 

£»-,  and  *p=X, 

our  equation  reduces  to 

1  _   H      2*H'  (m 

5P-5rP" xr  (19) 

It  is  well  known  that  the  relation  between  n  and  X  may  be  tolerably 
well  but  by  no  means  perfectly  represented  by  an  equation  of  this 
form. 

13.  If  we  now  give  up  the  presumably  inaccurate  supposition  that 
A,  B,  etc,  and  A',  B',  etc,  are  constant,  equation  (19)  will  still  subsist, 
but  H  and  H'  will  not  be  constant  for  a  given  direction  of  oscillation, 
but  will  be  functions  of  p,  or,  what  amounts  to  the  same,  of  X. 
Although  we  cannot  therefore  use  the  equation  to  derive  a  priori  the 
relation  between  n  and  X,  we  may  use  it  to  derive  the  values  of  H 
and  H'  from  the  empirically  determined  relation  between  n  and  X. 
To  do  this,  we  must  make  use  again  of  the  general  principle  that  an 
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infinitesimal  variation  in  the  type  of  a  vibration,  due  to  a  constraint, 
will  not  affect  the  period.  If  we  first  consider  a  certain  system  of 
stationary  waves,  then  a  system  in  which  the  wave-length  is  greater 
by  an  infinitesimal  dl  (the  direction  of  oscillation  remaining  the  same), 
the  period  will  be  increased  by  an  infinitesimal  dp,  and  the  manner  in 
which  the  flux  distributes  itself  among  the  molecules  and  intermole- 
cular  spaces  will  presumably  be  infinitesimally  changed.  But  if  we 
suppose  that  in  the  second  system  of  waves  there  is  applied  a  con- 
straint compelling  the  flux  to  distribute  itself  in  the  same  way  among 
the  molecules  and  intermolecular  spaces  as  in  the  first  system  (so  that 
£'>  9  >  f  8Q*U  be  the  same  functions  as  before  of  £  9,  £ — a  supposition 
perfectly  compatible  with  the  fact  that  the  values  of  £  $,  f  are 
changed),  this  constraint,  according  to  the  principle  cited,  will  not 
affect  the  period  of  oscillation.  Our  equations  will  apply  to  such  a 
constrained  type  of  oscillation,  and  A,  B,  etc.,  and  A',  F,  etc.,  and 
therefore  H  and  H',  will  have  the  same  values  in  the  last  described 
system  of  waves  as  in  the  first  system,  although  the  wave-length  and 
the  period  have  been  varied.  Therefore,  in  differentiating  equation 
(18),  which  is  essentially  an  equation  between  I  and  pt  or  its  equivalent 
(19),  we  may  treat  H  and  H'  as  constant    This  gives 

2_dn    lirW 
"n8^*    X8  * 

We  thus  obtain  the  values  of  H'  and  H 

H--W3v     H=^"""^8_3x-  (20) 

By  determining  the  values  of  H  and  H'  for  different  directions  of 
oscillation,  we  may  determine  the  values  of  A,  B,  etc.,  and  A',  B',  etc 

By  means  of  these  equations,  the  ratios  of  the  statical  energy  (S), 
the  kinetic  energy  due  to  the  regular  part  of  the  flux  (T),  and  the 
kinetic  energy  due  to  the  irregular  part  of  the  flux  (T),  are  easily 
obtained  in  a  form  which  admits  of  experimental  determination. 
Equations  (8)  and  (9)  give 

Therefore,  by  (20), 


T    2*-H,m2xH,n«^     X«&»  =     dlogn 
T~HF~         A1"-     ~«<Z\=""dlogX' 

S    T+T    ,  T    d\og\-dlogn     dlogl 
T  =  ~? 1  +  T"       VlogX        "ZlofY 


(21) 
(22) 


i--S5r  <»> 
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Since  S,  T,  and  T  are  essentially  positive  quantities.,  their  ratios 
must  be  positive.  Equation  (21)  therefore  requires  that  the  index 
of  refraction  shall  increase  as  the  period  or  wave-length  in  vacuo 
diminishes.  Experiment  has  shown  no  exceptions  to  this  rule,  except 
such  as  are  manifestly  attributable  to  the  absorption  of  light. 

14.  It  remains  to  consider  the  relations  between  the  optical  pro- 
perties of  a  medium  and  the  planes  or  axes  of  symmetry  which  it 
may  possess.  If  we  consider  the  statical  energy  per  unit  of  volume 
(S)  and  the  period  as  constant,  we  may  regard  equation  (2)  as  the 
equation  of  an  ellipsoid,  the  radii  vectores  of  which  represent  in 
direction  and  magnitude  the  amplitudes  of  systems  of  waves  having 
the  same  statical  energy.  In  like  manner,  if  we  consider  the  kinetic 
energy  of  the  irregular  part  of  the  flux  per  unit  of  volume  (T)  and 
the  period  as  constant,  we  may  regard  equation  (9)  as  the  equation 
of  an  ellipsoid,  the  radii  vectores  of  which  represent  in  direction  and 
magnitude  the  amplitudes  of  systems  of  waves  having  the  same 
kinetic  energy  due  to  the  irregular  part  of  the  flux.  These  ellipsoids, 
which  we  may  distinguish  as  the  ellipsoids  (A,  B,  eta)  and  (A',  B',  etc.), 
as  well  as  the  ellipsoid  before  described,  which  we  may  call  the 
ellipsoid  (a,  &,  etc.),  must  be  independent  in  their  form  and  their 
orientation  of  the  directions  of  the  axes  of  coordinates,  being  deter- 
mined entirely  by  the  nature  of  the  medium  and  the  period  of 
oscillation.  They  must  therefore  possess  the  same  kind  of  symmetry 
as  the  internal  structure  of  the  medium. 

If  the  medium  is  symmetrical  about  a  certain  axis,  each  ellipsoid 
must  have  an  axis  parallel  to  that.  If  the  medium  is  symmetrical 
with  respect  to  a  certain  plane,  each  ellipsoid  must  have  an  axis  at 
right  angles  to  that  plane.  If  the  medium  after  a  revolution  of  less 
than  180°  about  a  certain  axis  is  then  equivalent  to  the  medium  in 
its  first  position,  or  symmetrical  with  it  with  respect  to  a  plane  at 
right  angles  to  that  axis,  each  ellipsoid  must  have  an  axis  of  revolution 
parallel  to  that  axis.  These  relations  must  be  the  same  for  light  of 
all  colors,  and  also  for  all  temperatures  of  the  medium. 

15.  From  these  principles  we  may  infer  the  optical  characteristics 
of  the  different  crystallographic  systems. 

In  crystals  of  the  isometric  system,  as  in  amorphous  bodies,  the 
three  ellipsoids  reduce  to  spheres.  Such  media  are  optically  isotropic 
at  least  so  far  as  any  properties  are  concerned  which  come  within  the 
scope  of  this  paper. 

In  crystals  of  the  tetragonal  or  hexagonal  systems,  the  three 
ellipsoids  will  have  axes  of  rotation  parallel  to  the  principal  crystal- 
lographic axis.  Since  the  ellipsoid  (a,  6,  etc.)  has  but  one  circular 
section,  there  will  be  but  one  optic  axis,  which  will  have  a  fixed 
direction. 
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In  crystals  of  the  orthorhombic  system,  the  three  ellipsoids  will 
have  their  axes  parallel  to  the  rectangular  crystallographic  axes.  If 
we  take  these  directions  for  the  axes  of  coordinates,  E,  F,  O,  E',  I",  G', 
e,f,  g  will  vanish  and  equation  (13)  will  reduce  to 

P* 
If  the  coordinate  axes  are  so  placed  that 

a  >  b  >  c, 

the  optic  axes  will  lie  in  the  X-Z  plane,  making  equal  angles  <f>  with 
the  axis  of  Z,  which  may  be  determined  by  the  equation 

1811  *-^~j*(B-C)--4ir1(B'-Cr)" 

To  get  a  rough  idea  of  the  manner  in  which  <f>  varies  with  the  period, 
we  may  regard  A,  B,  C,  A',  B',  C  as  constant  in  this  equation. 

But  since  the  lengths  of  the  axes  of  the  ellipsoid  (a,  6,  etc.)  vary 
with  the  period,  it  may  easily  happen  that  the  order  of  the  axes 
with  respect  to  magnitude  is  not  the  same  for  all  colors.  In  that 
case,  the  optic  axes  for  certain  colors  will  lie  in  one  of  the  principal 
planes,  and  for  other  colors  in  another.  For  the  color  at  which  the 
change  takes  place,  the  two  optic  axes  will  coincide.    The  differential 

coefficient  -J&  becomes  infinitely  great  as  the  optic  axes  approach 

coincidence. 

In  crystals  of  the  monoclinic  system,  each  of  the  three  ellipsoids 
will  have  an  axis  perpendicular  to  the  plane  of  symmetry.  We  may 
choose  this  direction  for  the  axis  of  X.  Then  F,  Q,  F,  G',  /,  g,  will 
vanish  and  equation  (13)  will  reduce  to 

P* 
The  angle  0  made  by  one  of  the  axes  of  the  ellipsoid  (a,  6,  etc)  in 
the  plane  of  symmetry  with  the  axis  of  T  and  measured  toward  the 
axis  of  Z,  is  determined  by  the  equation 

wn^-c-&~^C-B)-4^(Cr--B7 
To  get  a  rough  idea  of  the  dispersion  of  the  axes  of  the  ellipsoid 
(a,  6,  etc.)  in  the  plane  of  symmetry,  we  may  regard  B,  C,  E,  B',  C,  E', 
as  constant  in  this  equation,  and  suppose  the  axis  of  Y  so  placed  as 
to  make  E  vanish. 

It  is  evident  that  in  this  system  the  plane  of  the  optic  axes  will  be 
fixed,  or  will  rotate  about  one  of  the  lines  which  bisect  the  angles 
made  by  the  optic  axes,  according  as  the  mean  axis  of  the  ellipsoid 
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(a,  bt  etc.)  is  perpendicular  to  the  plane  of  symmetry  or  lies  in  that 
plane.  In  the  first  case  the  dispersion  of  the  two  optic  axes  will  be 
unequal  The  same  crystal,  however,  with  light  of  different  colors, 
or  at  different  temperatures,  may  afford  an  example  of  each  case. 

In  crystals  of  the  triclinic  system,  since  the  ellipsoids  (A,  B,  etc) 
and  (A',  B',  eta)  are  determined  by  considerations  of  a  different 
nature,  and  there  are  no  relations  of  symmetry  to  cause  a  coincidence 
in  the  directions  of  their  axes,  there  will  not  in  general  be  any  such 
coincidence.  Therefore  the  three  axes  of  the  ellipsoid  (a,  6,  etc),  that 
is,  the  two  lines  which  bisect  the  angles  of  the  optic  axes  and  their 
common  normal,  will  vary  in  position  with  the  color  of  the  light. 

16.  It  appears  from  this  foregoing  discussion  that  by  the  electro- 
magnetic theory  of  light  we  may  not  only  account  for  the  dispersion 
of  colors  (including  the  dispersion  of  the  lines  which  bisect  the  angles 
of  the  optic  axes  m  doubly  refracting  media),  but  may  also  obtain 
Fresnel's  laws  of  double  refraction  for  every  kind  of  homogeneous 
light  without  neglect  of  the  quantities  which  determine  the  dispersion 
of  colors. 

But  a  closer  approximation  than  that  of  this  paper  will  be  neces- 
sary to  explain  the  phenomena  of  circularly  polarizing  media,  which 
depend  on  very  minute  differences  of  wave-velocity,  represented 
perhaps  by  a  few  units  in  the  sixth  significant  figure  of  the  index 
of  refraction.  That  the  degree  of  approximation  which  will  give  the 
laws  of  circular  and  elliptic  polarization  will  not  add  any  terms  to 
the  equations  of  this  paper,  except  such  as  vanish  for  media  which 
do  not  exhibit  this  phenomenon,  will  be  shown  in  another  number 
of  this  Journal. 
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ON  DOUBLE  REFRACTION  IN  PERFECTLY  TRANSPARENT 
MEDIA  WHICH  EXHIBIT  THE  PHENOMENA  OF  CIR- 
CULAR POLARIZATION. 

[American  Journal  of  Science,  ser.  3,  voL  xxm,  pp.  460-476,  Jane,  1882.] 

1.  In  the  April  number  of  this  Journal,*  the  velocity  of  propagation 
of  a  system  of  plane  waves  ol  light,  regarded  as  oscillating  electrical 
fluxes,  was  discussed  with  such  a  degree  of  approximation  as  would 
account  for  the  dispersion  of  colors  and  give  Fresnel's  laws  of  double 
refraction.  It  is  the  object  of  this  paper  to  supplement  that  discussion 
by  carrying  the  approximation  so  much  further  as  is  necessary  in 
order  to  embrace  the  phenomena  of  circularly  polarizing  media. 

2.  If  we  imagine  all  the  velocities  in  any  progressive  system  of 
plane  waves  to  be  reversed  at  a  given  instant  without  affecting  the 
displacements,  and  the  system  of  wave-motion  thus  obtained  to  be 
superposed  upon  the  original  system,  we  obtain  a  system  of  stationary 
waves  having  the  same  wave-length  and  period  of  oscillation  as  the 
original  progressive  system.  If  we  then  reduce  the  magnitude  of 
the  displacements  in  the  uniform  ratio  of  two  to  one,  they  will  be 
identical,  at  an  instant  of  maximum  displacement,  with  those  of  the 
original  system  at  the  same  instant. 

Following  the  same  method  as  in  the  paper  cited,  let  us  especially 
consider  the  system  of  stationary  waves,  and  divide  the  whole  dis- 
placement into  the  regular  part,  represented  by  £,  9,  £  and  the 
irregular  part,  represented  by  ft  if,  f ,  in  accordance  with  the 
definitions  of  §  2  of  that  paper. 

3.  The  regular  part  of  the  displacement  is  subject  to  the  equations 
of  wave-motion,  which  may  be  written  (in  the  most  general  case 
of  plane  stationary  waves) 


,  cos  2x  y + 04  sin  2x  y  j  cos  2x : 


*-(*-       <      -  up 

*-(&  cos  2xy+&mn2xy)cos2x|, 

f-^yj  008  2x^+7,  sin  2xjj  cos  2x-, 


(1) 


y 


♦  See  page  182  of  thia  yolame. 
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where  I  denotes  the  wave-length,  p  the  period  of  oscillation,  v,  the 
distance  of  the  point  considered  from  the  wave-plane  passing  through 
the  origin,  alt  &,  yx  the  amplitudes  of  the  displacements  £  9,  £  in 
the  wave-plane  passing  through  the  origin,  and  Og,  #2,  y2  their 
amplitudes  in  a  wave-plane  one-quarter  of  a  wave-length  distant 
and  on  the  side  toward  which  it  increases.  If  we  also  write  L,  M,  N 
for  the  direction-cosines  of  the  wave-normal  drawn  in  the  direction 
in  which  u  increases,  we  shall  have  the  following  necessary  relations : 

U+M»+N»-l,  (2) 

u=Lr.+My+Ns,  (3) 

Lo.+Mft+Ny^O,    La.+Mft+Ny^O.  (4) 

4  That  the  vrregular  part  of  the  displacement  (£*,  9',  f)  at  any 
given  point  is  a  simple  harmonic  function  of  the  time,  having  the 
same  period  and  phase  as  the  regular  part  of  the  displacement  (£  9,  f ), 
may  be  proved  by  the  single  principle  of  superposition  of  motions, 
and  is  therefore  to  be  regarded  as  exact  in  a  discussion  of  this  kind. 
But  the  further  conclusion  of  the  preceding  paper  (§  4),  "that  the 
values  of  £,  rft  f  at  any  given  point  in  the  medium  are  capable  of 
expression  as  linear  functions  of  £  q,  f  in  a  manner  which  shall  be 
independent  of  the  time  and  of  the  orientation  of  the  wave-planes  and 
the  distance  of  a  nodal  plane  from  the  point  considered,  so  long  as  the 
period  of  oscillation  remains  the  same,"  is  evidently  only  approxima- 
tive, although  a  very  close  approximation.  A  very  much  closer 
approximation  may  be  obtained,  if  we  regard  £ ,  rft  f ,  at  any  given 
point  of  the  medium  and  for  light  of  a  given  period,  as  linear  functions 
of  £  9,  f  and  the  nine  differential  coefficients 


l>  £.  $-  £. ete- 

z     dx'    dx     dyf 


We  shall  write  £  tj,  f  and  diff.  coeff.  to  denote  these  twelve  quantities. 
From  this  it  follows  immediately  that  with  the  same  degree  of 
approximation  £,  #',  £'  may  be  regarded,  for  a  given  point  of  the 
medium  and  light  of  a  given  period,  as  linear  functions  of  £  rjy  £  and 
the  differential  coefficients  of  £  #,  f  with  respect  to  the  coordinates. 
For  these  twelve  quantities  we  shall  write  £  tf,  £  and  diff.  coeff. 

5.  Let  us  now  proceed  to  equate  the  statical  energy  of  the  medium 
at  an  instant  of  no  velocity  with  its  kinetic  energy  at  an  instant  of 
no  displacement  It  will  be  convenient  to  estimate  each  of  these 
quantities  for  a  unit  of  volume. 

6.  The  statical  energy  of  an  infinitesimal  element  of  volume  may  be 
represented  by  a  dv,  where  <r  is  a  quadratic  function  of  the  components 
of  displacement  £+£,  tf+tf'>  £+T*  Since  for  that  element  of  volume 
£,  tj\  f '  may  be  regarded  as  linear  functions  of  £  rj,  f  and  diff.  coeff., 
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we  may  regard  <r  as  a  quadratic  function  of  £,  fj,  f  and  diff.  coeff.,  or 

as  a  linear  function  of  the  seventy-eight  squares  and  products  of  these 

quantities.    But  the  seventy-eight  coefficients  by  which  this  function 

is  expressed  will  vary  with  the  position  of  the  element  of  volume  with 

respect  to  the  surrounding  molecules. 

In  estimating  the  statical  energy  for  any  considerable  space  by  the 

integral  ~    , 

^  f<rdv, 

it  will  be  allowable  to  substitute  for  the  seventy-eight  coefficients 

contained  implicitly  in  <r  their  average  values  throughout  the  medium. 

That  is,  if  we  write  8  for  a  quadratic  function  of  £  9,  f,  cmd  diff.  coeff. 

in  which  the  seventy-eight  coefficients  are  the  space-averages  of  those 

in  <r,  the  statical  energy  of  any  considerable  space  may  be  estimated 

by  the  integral  r   , 

*  ^  fsdv. 

(This  will  appear  most  distinctly  if  we  suppose  the  integration  to 
be  first  effected  for  a  thin  slice  of  the  medium  bounded  by  two 
wave-planes.)  The  seventy-eight  coefficients  of  this  function  s  are 
determined  solely  by  the  nature  of  the  medium  and  the  period  of 
oscillation. 

We  may  divide  8  into  three  parts,  of  which  the  first  (*,)  contains  the 
squares  and  products  of  £  fl,  f  ,  the  second  (a„)  contains  the  products  of 
€>  9>  £  with  the  differential  coefficients,  and  the  third  («„,)  contains  the 
squares  and  products  of  the  differential  coefficients.  It  is  evident  that 
the  average  statical  energy  of  the  whole  medium  per  unit  of  volume 
is  the  space-average  of  8,  and  that  it  will  consist  of  three  parts,  which 
are  the  space-averages  of  a, ,  8„ ,  and  *,„ ,  respectively.  These  parts  we 
may  call  S,,  S„,  and  S,„.  Only  the  first  of  these  was  considered  in 
the  preceding  paper. 

Now  the  considerations  which  justify  us  in  neglecting,  for  an 
approximate  estimate,  the  terms  of  8  which  contain  the  differential 
coefficients  of  £,  9,  f  with  respect  to  the  coordinates,  will  apply  with 
especial  force  to  the  terms  which  contain  the  squares  and  products  of 
these  differential  coefficients.  Therefore,  to  carry  the  approximation 
one  step  beyond  that  of  the  preceding  paper,  it  will  only  be  necessary 
to  take  account  of  *,  and  8„  and  of  S,  and  S„. 

7.  We  may  set 

a^A^+B^-fC^+^-fFff-l-Gft,  (5) 

where,  for  a  given  medium  and  light  of  a  given  period,  A,  B,  C,  E,  F, 
O  are  constant 
Since  the  average  values  of 

sui82x7,    008*2x7,    sin 2x7- cos 2xy 
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are  respectively  J,  J,  and  0,  and  since  at  the  time  to  be  considered 

coef2x--l, 
V 

it  will  appear  from  inspection  of  equations  (1)  that 

S/-J(Aal1+BAf+0yi,+Bftyk+Pyka1+OalA) 

+  i(Aa22+Bi8at-fCy22+Ei8fy,+Fyfaf+GaA).  (6) 

This  is  the  first  part  of  the  statical  energy  of  the  whole  medium 
per  unit  of  volume. 

8.  The  second  part  of  the  statical  energy  of  the  whole  medium 
per  unit  of  volume  (S„)  is  the  space-average  of  «„,  which  is  a  linear 
function  of  the  twenty-seven  products  of  £,  9,  £  with  their  differential 
coefficients  with  respect  to  the  coordinates.    Now  since 


€-m  *%-m  ~. 


the  space-average  of  such  products  will  be  zero,  and  they  will 
contribute  nothing  to  the  value  of  S„.  There  will  be  nine  of  these 
products,  in  which  the  same  component  of  displacement  appears 
twice.  The  remaining  eighteen  products  may  be  divided  into  pairs 
according  to  the  letters  which  they  contain,  as 

•2  »d  £ 

A  linear  function  of  the  eighteen  products  may  also  be  regarded 
as  a  linear  function  of  the  sums  and  differences  of  the  products  in 
such  pairs.    But  since 

the  terms  of  «„  containing  such  sums  will  contribute  nothing  to  the 
value  of  S„.    We  have  left  a  linear  function  of  the  nine  differences 

d£     fid**!     *&€    +dt      A^n       d(       . 

»aHas«  w-t&  *ar'a£'  eta 

(the  unwritten  expressions  being  obtained  by  substituting  in  the 
denominators  dy  and  dz  for  dx),  which  constitutes  the  part  of  *„ 
that  we  have  to  consider.  S„  is  therefore  a  linear  function  of  the 
space-averages  of  these  nine  quantities.    But  by  (3) 


'l-^-K'l-^' 


and  the  space-average  of  this,  at  a  moment  of  maximum  displacement, 

—(Piyt-YiPih 
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By  such  redactions  it  appears  that  IS„  is  a  linear  function  of  the 
nine  products  of  L,  M,  N  with 

PiYt-viP*  yia8-aiy«>  «i&-Aoi- 

Now  if  we  set 

e-L08iyt-yli81)+M(y1ai-a1yf)-fN(ali81-/31a2),  (7) 

we  have  by  (4)  and  (2) 

^-Ayt-Vift,  MG^y^-o^,  Ne-aA-Aor  (8) 

Therefore  18„  is  a  linear  function  of  the  nine  products  of  L,  M,  N 
with  L6,  M6,  N6.  That  is,  18„  is  the  product  of  6  and  a  quadratic 
function  of  L,  M  and  N.     We  may  therefore  write 

S//  =  fe-f[L(/81yt-y^2)+M(y1a1-a1y8)+N(a1/3a-/31a2)],      (9) 

where  $  is  a  quadratic  function  of  L,  M  and  N,  dependent,  however, 
on  the  nature  of  the  medium  and  the  period  of  oscillation. 

9.  It  will  be  useful  to  consider  more  closely  the  geometrical 
significance  of  the  quantity  0.  For  this  purpose  it  will  be  convenient 
to  have  a  definite  understanding  with  respect  to  the  relative  position 
of  the  coordinate  axes. 

We  shall  suppose  that  the  axes  of  X,  T,  and  Z  are  related  in  the 
same  way  as  lines  drawn  to  the  right,  forward  and  upward,  so  that 
a  rotation  from  X  to  Y  appears  clockwise  to  one  looking  in  the 
direction  of  Z. 

Now  if  from  any  same  point,  as  the  origin  of  coordinates,  we  lay 
off  lines  representing  in  direction  and  magnitude  the  displacements 
in  all  the  different  wave-planes,  we  obtain  an  ellipse,  which  we  may 
call  the  <U8placement-ettip9e.*  Of  this,  one  radius  vector  (fa)  will 
have  the  components  av  &,  yx>  and  another  (/a,)  the  components 
<H>  fit*  Yr  These  will  belong  to  conjugate  diameters,  each  being 
parallel  to  the  tangent  at  the  extremity  of  the  other.  The  area  of 
the  ellipse  will  therefore  be  equal  to  the  parallelogram  of  which 
fa  and  fa  are  two  sides,  multiplied  by  it.  Now  it  is  evident  that 
fiiYt—Yifiv  yiai-"aiy«»  <*ifit~-aifi%  »re  numerically  equal  to  the  pro- 
jections of  this  parallelogram  on  the  planes  of  the  coordinate  axes, 
and  are  each  positive  or  negative  according  as  a  revolution  from 
fa  to  pt  appears  clockwise  or  counter-clockwise  to  one  looking  in 
the  direction  of  the  proper  coordinate  axis.  Hence,  6  will  be 
numerically  equal  to  the  parallelogram,  that  is,  to  the  area  of  the 
displacement-ellipse  divided  by  x,  and  will  be  positive  or  negative 

*  This  ellipse,  whioh  represents  the  simultaneous  displacements  in  different  parts  of 
the  field,  will  also  represent  the  suooessive  displacements  at  any  same  point  in  the 
corresponding  system  of  progressive  waves. 
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according  as  a  revolution  from  ft  to  ft  appears  clockwise  or  counter- 
clockwise to  one  looking  in  the  direction  of  the  wave-normal  Since 
ft  and  ft  are  determined  by  displacements  in  planes  one-quarter  of 
a  wave-length  distant  from  each  other,  and  the  plane  to  which  the 
latter  relates  lies  on  the  side  toward  which  the  wave-normal  is  drawn, 
it  follows  that  9  is  positive  or  negative  according  as  the  combination 
of  displacements  has  the  character  of  a  right-handed  or  a  left-handed 
screw. 

10.  The  kinetic  energy  of  the  medium,  which  is  to  be  estimated 
for  an  instant  of  no  displacement,  may  be  shown  as  in  §  7  of  the 
former  paper  (page  185  of  this  volume)  to  consist  of  two  parts,  of 
which  one  relates  to  the  regular  flux  (£,  rj,  £),  and  the  other  to  the 
irregular  flux  (£*,  if,  £ ).  The  first,  in  the  notation  of  that  paper,  is 
represented  by 

i/(£Pot  £  +fj  Pot  *+f  Pot  £)dv, 

which  reduces  to 

By  substitution  of  the  values  given  by  equations  (1),  we  obtain  for 
the  kinetic  energy  due  to  the  regular  flux  in  a  unit  of  volume 

T=^(a1*+/31Hy1*+aJ*+A«+yss).  (10) 

11.  The  kinetic  energy  of  the  irregular  part  of  the  flux  is  repre- 
sented by  the  volume-integral 

/i(f  Potf+jj'Potij'+^Potftcfo;. 

Now,  since  £  if,  %  are  everywhere  linear  functions  of  £,  tf,  f  and 
diff.  coeff.  (see  §  4),  and  since  the  integrations  implied  in  the  notation 
Pot  may  be  confined  to  a  sphere  of  which  the  radius  is  small  in 
comparison  with  a  wave  length,*  and  since  within  such  a  sphere  £,  9,  f 
and  diff.  coeff.  are  sufficiently  determined  (in  a  linear  form),  by  the 
values  of  the  same  twelve  quantities  at  the  center  of  the  sphere,  it 
f 0II0W8  that  Pot  £,  Pot  if9  Pot  f  must  be  linear  functions  of  the  values 
of  £,  1,  f  and  diff  coeff.  at  the  point  for  which  the  potential  is  sought. 
Hence, 

KfPot^+^PoM'  +  rPotO 

will  be  a  quadratic  function  of  £  17,  f  and  diff.  coeff.  But  the 
seventy-eight  coefficients  by  which  this  function  is  expressed  will 
vary  with  the  position  of  the  point  considered  with  respect  to  the 
surrounding  molecules. 


*  See  §  9  of  the  former  paper,  on  page  187  of  this  volume. 
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Yet,  as  in  the  case  of  the  statical  energy,  we  may  substitute  the 
average  values  of  these  coefficients  for  the  coefficients  themselves  in 
the  integral  by  which  we  obtain  the  energy  of  any  considerable  space. 
The  kinetic  energy  due  to  the  irregular  part  of  the  flux  is  thus 
reduced  to  a  quadratic  function  of  £,  rjy  £  and  diff.  coeff.  which  has 
constant  coefficients  for  a  given  medium  and  light  of  a  given  period. 

The  function  may  be  divided  into  three  parts,  of  which  the  first 
contains  the  squares  and  products  of  £  r\>  £  the  second  the  products 
of  £  9,  £  with  their  differential  coefficients,  and  the  third,  which  may 
be  neglected,  the  squares  and  products  of  the  differential  coefficients. 

We  may  proceed  with  the  reduction  precisely  as  in  the  case  of  the 
statical  energy,  except  that  the  differentiations  with  respect  to  the 

time  will  introduce  the  constant  factor  .—5-.     This  will  give  for 

the  first  part  of  the  kinetic  energy  of  the  irregular  flux  per  unit 
of  volume 

T'/=^(AV+BW+Cy1*+E'Ay1+Fy1«1+G'«1A) 

+^(AV+B'i8t*+Cy,*+E'i8,y,+Fyt«t+G'«i/8,),    (11) 
and  for  the  second  part  of  the  same 

=^'[L031y,-y1A)+M(y1«1-«1yi)+N(«^-j81«t)]J     (12) 

where  A',  B',  C,  E',  F,  Q'  are  constant,  and  $'  a  quadratic  function  of 
L,  M,  and  N,  for  a  given  medium  and  light  of  a  given  period. 
12.  Equating  the  statical  and  kinetic  energies,  we  have 
S,+S„-T+T',+T'w, 
that  is,  by  equations  (6),  (9),  (10),  (11),  and  (12), 

i(Aal«+B/3l*+Cy1*+Ei81y.+Fy1a1+G«li8l) 
+  i(Aat*+Bft*+(>y1*+EJ8ly,+Fysa1+Ga1&) 

+f  [MAtt-yiftH  M(y1a1-a1y2)+N(«1/9s-i8la1)] 

=|r<«1,+&,+ylf+V+&,+y,i) 

+^(A'a1«+B'J81«+Cy1«+E'Ay1+Fy1a1+G'«li81) 

+2^(AV+BW+Cy,*+E'/32yi+Fyt«t+G'a1A) 

+^T  [L08.y«-yii8.)+M(y1at-«1yJ)+N(«1y3,-/3la1)].    (13) 
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If  we  set  A     2wA'      ,      B     2»B'       .  ,1Av 

a=2i—pr>    b=to—pr'   cte-  (U) 

and  ^____,  (15) 

the  equation  reduces  to 

+aat*+bp*+cy*+eptyt+fYiat+gatpt 

+  ^[L(/3lyt-yli8t)+M(y1a1-a1y1)+N(«li8i-A^] 

-p(«lt+Pi*+Yi,+°tt+&i+Y*t),  (16) 

where  a,  b,  c,  e,  /,  g  are  constant,  and  <j>  a  quadratic  function  of 
L,  M,  N,  for  a  given  medium  and  light  of  a  given  period. 

13.  Now  this  equation,  which  expresses  a  relation  between  the 
constants  of  the  equations  of  wave-motion  (1),  will  apply,  with  those 
equations,  not  only  to  such  vibrations  as  actually  take  place,  but  also 
to  such  as  we  may  imagine  to  take  place  under  the  influence  of 
constraints  determining  the  type  of  vibration.  The  free  or  uncon- 
strained vibrations,  with  which  alone  we  are  concerned,  are  charac- 
terized by  this,  that  infinitesimal  variations  (by  constraint)  of  the 
type  of  vibration,  that  is,  of  the  ratios  of  the  quantities  alt  ft,  yt, 
<**»  fit>  Vt>  W*U  not  affect  the  period  by  any  quantity  of  the  same 
order  of  magnitude*  These  variations  must  however  be  consistent 
with  equations  (4),  which  require  that 

Ld^+Mdft+Ndy^O,  Ldoj+Mdft+Ndy^O.  (17) 
Hence,  to  obtain  the  conditions  which  characterize  free  vibration, 
we  may  differentiate  equation  (16)  with  respect  to  c^,  ft,  yv  a,,  ft,  yt, 
regarding  all  other  letters  as  constant,  and  give  to  dc^,  dft,  dyv 
da^,  dft,  dyv  such  values  as  are  consistent  with  equations  (17). 
Now  dav  dft,  dyv  are  independent  of  dat,  dft,  dyt,  and  for  either 
three  variations,  values  proportional  either  to  av  ft,  yv  or  to  a,,  ft,  yv 
are  possible.  If,  then,  we  differentiate  equation  (16)  with  respect  to 
ai»  ft,  yv  and  substitute  first  ati  ft,  yx,  and  then  a2,  ft,  yt,  for 
dalt  dft,  dylt  and  also  differentiate  with  respect  to  av  ft,  yv  with 
similar  substitutions,  we  shall  obtain  all  the  independent  equations 
which  this  principle  will  yield. 

If  we  differentiate  with  respect  to  av  ft,  yv  and  write  av  ft,  yx 
for  dalt  dft,  dylt  we  obtain 

aa*+bp*+cy*+ePly1+fy1a1+ga1Pl 

+^[L(Ay«-yii82)+M(y1a,-a1y1)+N(ali82-ftat)] 

-Jw+Af+^  (18) 


*  Compare  1 11  of  the  former  paper,  page  189  of  this  volume. 
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If  we  differentiate  with  respect  to  ax,  /3V  ylt  and  write  a,,  &,  y%  for 
dalt  dfa,  dyv  we  obtain 

2oa1aJ+26)8li88+2(^1yJ+«08ly1+y^)+/(y1a,+a1y1) 

on 

+9(«A+Pi<h)=y  («i«i+ AA+ririX  (19) 

If  we  differentiate  with  respect  to  a,,  &,  y„  and  write  a,,  &,  yt  for 
ci<4,  d)8„  dy2,  we  obtain 

+1>?[L(/81y1-y1i81)+M(y1at-a1y1)+N(aA-Aat)] 

-pW+M+YJ)-  (20) 

The  equation  derived  by  differentiating  with  respect  to  a,,  ft,  y2,  and 
writing  alf  ft,  yx  for  dafi  d/82,  dy,,  is  identical  with  (19).  We  should 
also  observe  that  equations  (18)  and  (20)  by  addition  give  equation 
(16),  which  therefore  will  not  need  to  be  considered  in  addition  to 
the  last  three  equations. 

14.  The  geometrical  signification  of  our  equations  may  now  be 
simplified  by  a  suitable  choice  of  the  position  of  the  origin  of 
coordinates,  which  is  as  yet  wholly  arbitrary. 

We  shall  hereafter  suppose  that  the  origin  is  placed  in  a  plane  of 
maximum  or  minimum  displacement*  if  such  there  are.  In  the  case 
of  circular  polarization,  in  which  the  displacements  are  everywhere 
equal,  its  position  is  immaterial.  The  lines  /^  and  p^t  of  which  alt  /3lt 
yx  and  a^  fiv  y,  are  respectively  the  components,  will  now  be  the 
semi-axes  of  the  displacement-ellipse,  and  therefore  at  right  angles. 
(See  §  9.)  The  case  of  circular  polarization  will  not  constitute  any 
exception.    Hence, 

"i^+Aft+y^-O,  (21) 

and  by  §  9, 

9  =  L(i81yl-yli8J)+M(y1a1-.a1y1)+N(ali82-i8ia2)=  ±hP»     (22) 

where  we  are  to  read  +  or  —  in  the  last  member  according  as  the 
system  of  displacements  has  the  character  of  a  right-handed  or  a 
left-handed  screw. 

15.  Equation  (19)  is  now  reduced  to  the  form 

2oa1of+26/81i81+2cy1y,+e(/81y2+y1)82) 

+f(yi<h+«iy*)+9(«i&+Pi<h)=<>>       (23) 

•  The  reader  will  perceive  that  an  earlier  limitation  of  the  position  of  the  origin  by  a 
supposition  of  this  nature,  involving  a  limitation  of  the  values  of  a^  ft,  7,,  a,,  /S,,  y^f 
would  have  been  embarrassing  in  the  operations  of  the  last  paragraph. 
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which  has  a  very  simple  geometrical  signification.  If  we  consider 
the  ellipsoid  ax*+by*+cz*+eyz+fzx+gxy,  (24) 

and  especially  its  central  section  by  a  plane  parallel  to  the  planes  of 
the  wave-system  which  we  are  considering,  it  will  easily  appear  that 
the  equation 

2aiB1JB2+26y1y,+2c010t+«(y1«2+01yf) 

will  hold  of  any  two  points  xv  yv  zx  and  x%9  yt,  zt  which  belong  to 
conjugate  diameters  of  this  central  section.  Therefore  equation  (23) 
expresses  that  the  displacements  alt  /3lt  yx  and  04,  /8|,  yt  are  parallel 
to  conjugate  diameters  of  the  central  section  of  the  ellipsoid  (24)  by  a 
wave-plane.  But  since  the  displacements  alt  #p  yx  and  a,,  fa,  yt  are 
also  at  right  angles  to  each  other,  it  follows  that  they  are  parallel 
to  the  axes  of  the  central  section  of  the  ellipsoid  (24)  by  a  wave-plane. 
That  is: — The  axes  of  the  displacement-ellipse  coincide  in  direction 
with  those  of  a  central  section  of  the  ellipsoid  (24)  by  a  wave-plane. 

16.  If  we  write  TJX,  Ut  for  the  reciprocals  of  the  semi-axes  of  the 
central  section  of  the  ellipsoid  (24)  by  a  wave-plane,  Vx  being  the 
reciprocal  of  the  one  to  which  the  displacement  C4,  &,  yx  is  parallel, 
we  have 

as  is  at  once  evident  if  we  substitute  the  coordinates  of  an  extremity 
of  the  axis  for  the  proportional  quantities  av  &,  yv    So  also 

oa^+W+cy.He&y,*^^^  (26) 

If  we  write  V  for  the  velocity  of  propagation  of  the  system  of 
progressive  waves  corresponding  to  the  system  of  stationary  waves 
which  we  have  been  considering,  we  shall  have 

V-l  (27) 

P 

By  equations  (22),  (25),  and  (26),  equations  (18)  and  (20)  are 

reduced  to  the  form 

UxWi^-W,        <W±$ft*«VW.  (28) 

where  we  are  to  read  +  or  —  according  as  the  disturbance  has  the 
character  of  a  right-handed  or  a  left-handed  screw.  In  a  progressive 
system  of  waves,  when  the  combination  of  displacements  has  the 
character  of  a  right-handed  screw,  the  rotations  will  be  such  as  appear 
clockwise  to  the  observer,  who  looks  in  the  direction  opposite  to  that 
of  the  propagation  of  light.  We  shall  call  such  a  ray  right-handed. 
We  may  here  observe  that  in  case  0=0  the  solution  of  these 
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equations  is  very  simple.  We  have  necessarily  either  pss0  and 
V'sU!*,  or  px^Q  and  V^Uj*.  In  this  case,  the  light  is  linearly 
polarized,  and  the  directions  of  oscillation  and  the  velocities  of 
propagation  are  given  by  Fresnel's  law.  Experiment  has  shown  that 
this  is  the  usual  case.  We  wish,  however,  to  investigate  the  case  in 
which  <p  does  not  vanish.  Since  the  term  containing  <f>  arises  from 
the  consideration  of  those  quantities  which  it  was  allowable  to  neglect 
in  the  first  approximation,  we  may  assume  that  <p  is  always  very 
small  in  comparison  with  V8,  Uj8,  or  Uj*. 
17.  Equations  (28)  may  be  written 

y-U,*-*!*      V*-Ut«=±!g.  (29) 

By  multiplication  we  obtain 

V«(V2-U1l)(V*-UJ«)=^.  (30) 

Since  <p  is  a  very  small  quantity,  it  is  evident  from  inspection  of  this 
equation  that  it  will  admit  three  values  of  Vs,  of  which  one  will  be  a 
very  little  greater  than  the  greater  of  the  two  quantities  U^  and  U2*, 
another  will  be  a  very  little  less  than  the  less  of  the  same  two  quan- 
tities, and  the  third  will  be  a  very  small  quantity.  It  is  evident  that 
the  values  of  Vs  with  which  we  have  to  do  are  those  which  differ  but 
little  from  Uj*  and  U,2.* 

For  the  numerical  computation  of  V,  when  U1,  U,,  and  <f>  are 
known  numerically,  we  may  divide  the  equation  by  Vs,  and  then 
solve  it  as  if  the  second  member  were  known.    This  will  give 

T.-3^^£?SZ=SE  (3D 

By  substituting  VJJt  for  V*  in  the  second  member,  we  may  obtain 
a  close  approximation  to  the  two  values  of  V1.  Each  of  the  values 
obtained  may  be  improved  by  substitution  of  that  value  for  Vs  in  the 
second  member  of  the  equation. 

For  either  value  of  V2,  we  may  easily  find  the  ratio  of  ft  to  pt, 
that  is,  the  ratio  of  the  axes  of  the  displacement-ellipse,  from  one  of 
equations  (29),  or  from  the  equation 

%=w=rfr  <32> 

obtained  by  combining  the  two. 

•  We  should  not  attribute  any  physical  significance  to  the  third  value  of  V.  For 
this  value  would  imply  a  wave-length  very  small  in  comparison  with  the  length  of 
ordinary  waves  of  light,  and  with  respect  to  which  our  fundamental  assumption  that 
the  wave-length  is  very  great  in  comparison  with  the  distances  of  contiguous  molecules 
would  be  entirely  false.  Our  analysis,  therefore,  furnishes  no  reason  for  supposing  that 
any  suoh  velocities  are  possible  for  the  propagation  of  electrical  disturbances. 
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In  equations  (29),  we  are  to  read  +  or  —  in  the  second  members, 
according  as  the  ray  is  right-handed  or  left-handed.  (See  §  16.)  It 
follows  that  if  the  value  of  <p  is  positive,  the  greater  velocity  will 
belong  to  a  right-handed  ray,  and  the  smaller  to  a  left-handed,  but  if 
the  value  of  <p  is  negative,  the  opposite  is  the  case.  Except  when 
0  =  0,  and  the  polarization  is  linear,  there  will  be  one  right-handed 
and  one  left-handed  ray  for  any  given  wave-normal  and  period. 

18.  When  U^H,,  equations  (29)  give 

where  U  represents  the  common  value  of  Uj  and  Us.  The  polariza- 
tion is  therefore  circular.  The  converse  is  also  evident  from  equations 
(29),  viz.,  that  a  ray  can  be  circularly  polarized  only  when  the 
direction  of  its  wave-normal  is  such  that  U1=sUs.  Such  a  direction, 
which  is  determined  by  a  circular  section  of  the  ellipsoid  (24)  precisely 
as  an  optic  axis  of  a  crystal  which  conforms  to  Fresners  law  of  double 
refraction,  may  be  called  an  optic  axis,  although  its  physical  pro- 
perties are  not  the  same  as  in  the  more  ordinary  case*  If  we  write 
VE  and  VL,  respectively,  for  the  wave-velocities  of  the  right-handed 
and  left-handed  rays,  we  have 

VB'=U»+£,     V^-TP-^;  (33) 

whence  vE'-Vt»=0(^+^-)=*^±£, 

and  va-VL-^  (34) 

The  phenomenon  best  observed  with  respect  to  an  optic  axis  is  the 
rotation  of  the  plane  of  linearly  polarized  light.  If  we  denote  by  0 
the  amount  of  this  rotation  per  unit  of  the  distance  traversed  by  the 
wave-plane,  regarding  it  as  positive  when  it  appears  olockwise  to  the 

*  Our  experimental  knowledge  of  oiroulsrly  or  elliptioally  polarising  media  is  confined 
to  soon  as  are  optically  either  isotropic  or  uniaxial  The  general  theory  of  suoh  media, 
embracing  the  oase  of  two  optio  axes,  has  however  been  discussed  by  Professor  von  Lang 
("Theorie  der  Ciroularpolarization,"  Sitz.-Ber.  Wiener  A  lead.,  voL  lxxv,  p.  719).  The 
general  results  of  the  present  paper,  although  derived  from  physical  hypotheses  of  an 
entirely  different  nature,  are  quite  similar  to  those  of  the  memoir  cited.    They  would 

become  identical,  the  writer  believes,  by  the  substitution  of  a  constant  f or  ?P  or  £ 
in  the  equations  of  this  paper.    (8ee  especially  equations  (18),  (20),  (28).) 

That  a  complete  discussion  of  the  subject  on  any  theory  must  inolude  the  oase  of 
biaxial  media  having  the  property  of  oiroular  or  elliptical  polarisation,  is  evident  from 
the  consideration  that  it  must  at  least  be  possible  to  produce  examples  of  suoh  media 
artificially.  An  isotropio  or  uniaxial  crystal  may  be  made  biaxial  by  pressure.  If  it 
has  the  property  of  oiroular  and  elliptio  polarisation,  that  property  cannot  be  wholly 
destroyed  by  the  application  of  small  pressures. 
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observer,  who  looks  in  the  direction  opposite  to  that  of  the  propagation 
of  the  light,*  we  have 


°S  P^l 


By  the  preceding  equation,  this  reduces  to 

e-rfk>-  (86> 

Without  any  appreciable  error,  we  may  substitute  U4  for  VB*VL5, 
which  will  givet 

•-#•  <37> 

19.  Since  these  equations  involve  unknown  functions  of  the  period 
they  will  not  serve  for  an  exact  determination  of  the  relation  between 
9  and  the  period.  For  a  rough  approximation,  however,  we  may 
assume  that  the  manner  in  which  the  general  displacement  in  any 
small  part  of  the  medium  distributes  itself  among  the  molecules  and 
intermolecular  spaces  is  independent  of  the  period,  being  determined 
entirely  by  the  values  of  £,  9,  £  and  their  differential  coefficients  with 
respect  to  the  coordinates.  {  For  a  fixed  direction  of  the  wave-normal, 
$  and  $'  will  then  be  constant    Now  equations  (15)  and  (36)  give 

e'2j^^h*-p^7^-  (88) 

To  express  this  result  in  terms  of  the  quantities  directly  observed,  we 
may  use  the  equations 

k  n±  nh  n 

where  k  denotes  the  velocity  of  light  in  vacuo,  X  the  wave-length 
in  vacuo  of  the  light  employed,  n*,  n^  the  absolute  indices  of  refrac- 
tion of  the  two  rays,  and  n  the  index  for  the  optic  axis  as  derived 
from  the  ellipsoid  (24)  by  Fresnel's  law.    We  thus  obtain 

d=^m* v — •  (39) 

*  When  the  rotation  of  the  plane  of  polarization  appears  clockwise  to  the  observer, 
it  has  the  oharaoter  of  a  left-handed  screw.  Bat  the  circularly  polarized  ray  to  whioh 
Vft  relates,  the  rotation  of  whioh  also  appears  clockwise  to  the  observer,  has  the 
oharaoter  of  a  right-handed  screw. 

t  The  degree  of  aoouraoy  of  this  substitution  may  be  shown  as  follows.    By  (38) 
V.OV-U^VittP-VA 
whence  V.'+V^Vj  +  V.jU*, 

VmVt=U«-(Va-V1)«. 
X  Compare  1 12  of  the  former  paper,  on  page  189  of  this  volume. 
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In  the  case  of  uniaxial  crystals,  the  direction  of  the  optic  axis  is 
fixed.    We  may  therefore  write 

O-Vn,.4  (§+§),  (40) 

regarding  E  and  K'  as  constants.  If  we  had  used  equation  (37),  we 
should  have  had  the  factor  nA  instead  of  **»*%*.  Since  this  factor 
varies  but  slowly  with  X,  it  may  be  neglected,  if  its  omission  is 
compensated  in  the  values  of  E  and  E'.  The  formula  being  only 
approximative,  such  a  simplification  will  not  necessarily  render  it 
less  accurate. 

20.  But  without  any  such  assumption  as  that  contained  in  the 
last  paragraph,  we  may  easily  obtain  formulas  for  the  experimental 
determination  of  $  and  $'  for  the  optic  axis  of  a  uniaxial  crystal 
Considerations  analogous  to  those  of  §  13  of  the  former  paper 
(page  190  of  this  volume),  show  that  in  differentiating  equation  (39) 
we  may  regard  $  and  $'  as  constant,  although  they  may  actually 
vary  with  X.    This  equation  may  be  written 

i?-w — w  (41) 


Therefore, 


<© 


1-2**$'.  (42) 


When  $'  has  been  determined  by  this  equation,  $  may  be  found 
from  the  preceding. 

21.  If  we  wish  to  represent  <f>  geometrically,  like  l^  and  Us,  we 
may  construct  the  surfaces 

Aajt+By1+c«8+Ey«+F«c+Ga^=  ±1,  (43) 

the  coefficients  a,  b,  etc.,  being  the  same  by  which  <f>  is  expressed  in 
terms  of  L*,  M*,  etc  The  numerical  value  of  <f>,  for  any  direction 
of  the  wave-normal,  will  thus  be  represented  by  the  square  of  the 
reciprocal  of  the  radius  vector  of  the  surface  drawn  in  the  same 
direction.  The  positive  or  negative  character  of  <j>  must  be  separately 
indicated.  There  are  here  two  cases  to  be  distinguished.  If  the 
sign  of  <f>  is  the  same  in  all  directions,  the  surface  will  be  an 
ellipsoid,  and  we  have  only  to  know  whether  all  the  values  of  <f>  are 
to  be  taken  positively  or  all  negatively.  But  if  <f>  is  positive  for  some 
directions  and  negative  for  others,  the  surface  will  consist  of  two 
conjugate  hyperboloids.  to  one  of  which  the  positive,  and  to  the  other 
the  negative  values  belong. 
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22.  The  manner  in  which  the  ellipsoid  (24)  may  be  partially 
determined  by  the  relations  of  symmetry  which  the  medium  may 
possess,  has  been  sufficiently  discussed  in  the  former  paper. 

With  respect  to  the  quantity  <j>,  and  the  surfaces  which  determine 
it,  the  following  principle  is  of  fundamental  importance.  If  one  body 
is  identical  in  its  internal  structure  with  the  image  by  reflection  of 
another,  the  values  of  0  in  corresponding  lines  in  the  two  bodies  will 
be  numerically  equal  but  have  opposite  signs.* 

It  follows  that  if  a  body  is  identical  in  internal  structure  with  its 
own  image  by  reflection,  the  value  of  <f>  (if  not  zero  for  all  directions) 
must  be  positive  for  some  directions  and  negative  for  others.  More- 
over, the  above  described  surface  by  which  <f>  is  represented  must 
consist  of  two  conjugate  hyperboloids,  of  which  one  is  identical  in 
form  with  the  image  by  reflection  of  the  other.  This  requires  that 
the  hyperboloids  shall  be  light  cylinders  with  conjugate  rectangular 
hyperbolas  for  bases.  A  crystal  characterized  by  such  properties  will 
belong  to  the  tetragonal  system.  Since  0  —  0  for  the  optic  axis,  it 
would  be  difficult  to  distinguish  a  case  of  this  kind  from  an  ordinary 
uniaxial  crystal,  unless  the  ellipsoid  (24)  should  approach  very  closely 
to  a  sphere,  t 

It  is  only  in  the  very  limited  case  described  in  the  last  paragraph 
that  a  medium  which  is  identical  in  its  internal  structure  with  its 
image  by  reflection  can  have  the  property  of  circular  or  elliptic 
polarization.  To  media  which  are  unlike  their  images  by  reflection, 
and  have  the  property  of  circular  polarization,  we  may  apply  the 
following  general  principles. 

If  the  medium  has  any  axis  of  symmetry,  the  ellipsoid  or  hyper- 
boloids which  represent  the  values  of  <p  will  have  an  axis  in  the  same 
direction.  If  the  medium  after  a  revolution  of  less  than  180°  about 
any  axis  is  equivalent  to  the  medium  in  its  first  position,  the  ellipsoid 
or  hyperboloids  will  have  an  axis  of  revolution  in  that  direction. 

23.  The  laws  of  the  propagation  of  light  in  plane  waves,  which 


*The  necessity  of  the  opposite  signs  will  perhaps  appear  most  readily  from  the 
consideration  that  the  direotion  of  rotation  of  the  plane  of  polarization  must  be  opposite 
in  the  two  bodies. 

t  There  is  no  difficulty  in  conceiving  of  the  constitution  of  a  body  whioh  would  have 
the  properties  described  above.  Thusi  we  may  imagine  a  body  with  molecules  of  a 
spiral  form,  of  whioh  one-half  are  right-handed  and  one-half  left-handed,  and  we  may 
suppose  that  the  motion  of  electricity  is  opposed  by  a  less  resistance  within  them  than 
without.  If  the  axes  of  the  right-handed  molecules  are  parallel  to  the  axis  of  X,  and 
those  of  the  left-handed  molecules  to  the  axis  of  Y,  their  effects  would  counterbalance 
one  another  when  the  wave-normal  is  parallel  to  the  axis  of  Z.  But  when  the  wave- 
normal  (of  a  beam  of  linearly  polarized  light)  is  parallel  to  the  axis  of  X,  the  left-handed 
molecules  would  produce  a  left-handed  (negative)  rotation  of  the  plane  of  polarization, 
the  right-handed  molecules  having  no  effect ;  and  when  the  wave-normal  is  parallel  to 
the  axis  of  Y,  the  reverse  would  be  the  case. 
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have  thus  been  derived  from  the  single  hypothesis  that  the  disturb- 
ance by  which  light  is  transmitted  consists  of  solenoidal  electrical 
fluxes,  and  which  apply  to  light  of  different  colors  and  to  the  most 
general  case  of  perfectly  transparent  and  sensibly  homogeneous  media 
not  subject  to  magnetic  action*  are  essentially  those  which  are 
generally  received  as  embodying  the  results  of  experiment  In' no 
particular,  so  far  as  the  writer  is  aware,  do  they  conflict  with  the 
results  of  experiment,  or  require  the  aid  of  auxiliary  and  forced 
hypotheses  to  bring  them  into  harmony  therewith. 

In  this  respect,  the  electromagnetic  theory  of  light  stands  in  marked 
contrast  with  that  theory  in  which  the  properties  of  an  elastic  solid 
are  attributed  to  the  ether,-J— a  contrast  which  was  very  distinct  in 
Maxwell's  derivation  of  Fresnel's  laws  from  electrical  principles,  but 
becomes  more  striking  as  we  follow  the  subject  farther  into  its  details, 
and  take  account  of  the  want  of  absolute  homogeneity  in  the  medium, 
so  as  to  embrace  the  phenomena  of  the  dispersion  of  colors  and  circular 
and  elliptical  polarization. 


*  The  rotation  of  the  plane  of  polarization  whioh  is  produced  by  magnetic  action  hat 
been  discussed  by  Maxwell  (Treatise  on  Electricity  and  Magnetism,  voL  ii,  ohap.  xxi), 
and  by  Rowland  (Amer,  Joum.  Math.,  voL  iii,  p.  107). 
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ON  THE  GENERAL  EQUATIONS  OF  MONOCHROMATIC  LIGHT 
IN  MEDIA  OF  EVERY  DEGREE  OF  TRANSPARENCY. 

[American  Journal  of  Science,  ser  3,  vol.  xxv,  pp.  107-118,  February,  1883.] 

1.  The  last  April  and  June  numbers  of  this  Journal*  contain  an 
investigation  of  the  velocity  of  plane  waves  of  light,  in  which  they 
are  regarded  as  consisting  of  solenoidal  electrical  fluxes  in  an  indefi- 
nitely extended  medium  of  uniform  and  very  fine-grained  structure. 
It  was  also  supposed  that  the  medium  was  perfectly  transparent, 
although  without  discussion  of  the  physical  properties  on  which 
transparency  depends,  and  that  the  electrical  motions  were  not 
complicated  by  any  distinctively  magnetic  phenomena. 

In  the  present  paper  t  the  subject  will  be  treated  with  more 
generality,  so  as  to  obtain  the  general  equations  of  monochromatic 
light  for  media  of  every  degree  of  transparency,  whether  sensibly 
homogeneous  or  otherwise,  which  have  a  very  fine-grained  molecular 
structure  as  measured  by  a  wave-length  of  light  There  will  be  no 
restriction  with  respect  to  magnetic  influence,  except  that  an  oscil- 
lating magnetization  of  the  medium  will  be  excluded. } 

In  order  to  conform  as  much  as  possible  to  the  ordinary  view  of 

*  See  pages  182-194  and  196-210  of  this  volume. 

fThis  paper  contains,  with  some  additional  developments,  the  substance  of  a  com- 
munication to  the  National  Academy  of  Sciences  in  November,  1882. 

%  Where  a  body  capable  of  magnetisation  is  subjected  to  the  influence  of  light  (as 
when  light  is  reflected  from  the  surface  of  iron),  there  are  two  simple  hypotheses  which 
present  themselves  with  respeot  to  the  magnetio  state  of  the  body.  One  is  that  the 
magnetic*  forces  due  to  the  light  are  not  of  sufficient  duration  to  allow  the  molecular  v 
changes  whioh  constitute  magnetization  to  take  place  to  any  sensible  extent.  The  other 
is  that  the  magnetization  has  a  constant  ratio  to  the  magnetio  force  without  regard  to 
its  duration.  We  might  easily  make  a  more  general  hypothesis  whioh  would  embrace 
both  of  those  mentioned  as  extreme  oases,  and  which  would  be  irreproachable  from  a 
theoretical  stand-point ;  but  it  would  complicate  our  equations  to  a  degree  whioh  would 
not  be  compensated  by  their  greater  generality,  since  no  phenomena  depending  on  suoh 
magnetization  have  been  observed,  so  far  as  the  writer  is  aware,  or  are  likely  to  be, 
except  in  a  very  limited  olass  of  cases. 

For  the  purposes  of  this  paper,  therefore,  it  has  seemed  better  to  exclude  media 
capable  of  magnetization,  except  so  far  as  the  first  mentioned  hypothesis  may  be  appli- 
cable. But  it  does  not  appear  that  this  requires  us  to  exclude  oases  in  whioh  the  medium 
is  subject  to  the  influence  of  a  permanent  magnetio  force,  suoh  as  produces  the 
phenomenon  of  the  magnetio  rotation  of  the  plane  of  polarisation. 
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electrical  phenomena,*  we  shall  not  introduce  at  first  the  hypothesis 
of  Maxwell  that  electrical  fluxes  are  solenoidaLt  Our  results,  however, 
will  be  such  as  to  require  us  to  admit  the  substantial  truth  of  this 
hypothesis,  if  we  regard  the  processes  involved  in  the  transmission  of 
light  as  electrical 

With  regard  to  the  undetermined  questions  of  electrodynamic  in- 
duction, we  shall  adopt  provisionally  that  hypothesis  which  appears  the 
most  simple,  yet  proceed  in  such  a  manner  that  it  will  be  evident  exactly 
how  our  results  must  be  altered,  if  we  prefer  any  other  hypothesis. 

Electrical  quantities  will  be  treated  as  measured  in  electromagnetic 
units. 

2.  We  must  distinguish,  as  before,  between  the  actual  electrical 
displacements,  which  are  too  complicated  to  follow  in  detail  with 
analysis,  and  which  in  their  minutiae  elude  experimental  demonstration, 
and  the  displacements  as  averaged  for  spaces  which  are  large  enough 
to  smooth  out  their  minor  irregularities,  but  not  so  large  as  to  oblite- 
rate to  any  sensible  extent  those  more  regular  features  of  the  electrical 
motion,  which  form  the  subject  of  optical  experiment  These  spaces 
must  therefore  be  large  as  measured  by  the  least  distances  between 
molecules,  but  small  as  measured  by  a  wave-length  of  light.  We 
shall  also  have  occasion  to  consider  similar  averages  for  other  quan- 
tities, as  electromotive  force,  the  electrostatic  potential,  etc  It  will 
be  convenient  to  suppose  that  the  space  for  which  the  average  is 
taken  is  the  same  in  all  parts  of  the  field,!  say  a  sphere  of  uniform 
radius  having  its  center  at  the  point  considered. 

Whatever  may  be  the  quantities  considered,  such  averages  will  be 
represented  by  the  notation 

t      w 

*  It  hM,  perhaps,  retarded  the  acceptance  of  the  electromagnetic*  theory  of  light  that 
it  was  presented  in  oonneotion  with  a  theory  of  electrical  action,  whioh  is  probably  more 
diffioult  to  prove  or  disprove,  and  oertainly  presents  more  difficulties  of  comprehension, 
than  the  connection  of  optical  and  electrical  phenomena,  and  whioh,  as  resting  largely 
on  a  priori  considerations,  must  naturally  appear  very  differently  to  different  minds. 
Moreover,  the  mathematical  method  by  whioh  the  subject  was  treated,  while  it  will 
remain  a  striking  monument  of  its  author's  originality  of  thought,  and  profoundly 
modify  the  development  of  mathematical  physics,  must  nevertheless,  by  its  wide  depar- 
ture from  ordinary  methods,  have  tended  to  repel  such  as  might  not  make  it  a  matter 
of  serious  study. 

t  A  flux  is  said  to  be  aolenoidai  when  it  satisfies  the  conditions  whioh  characterise 
the  motion  of  an  incompressible  fluid,— in  other  words,  if  u,  v,  w  are  the  rectangular 
components  of  the  flux,  when 

du    dv    dw    n 

and  the  normal  component  of  the  flux  is  the  same  on  both  sides  of  any  surfaces  of 
discontinuity  whioh  may  exist. 

♦  This  is  rather  to  fix  our  ideas,  than  on  aooount  of  any  mathematical  neoeesity.  For 
the  space  for  whioh  the  average  is  taken  may  in  general  be  considerably  varied  without 
sensibly  affecting  the  value  of  the  average. 
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If,  then,  £  9,  £  denote  the  components  of  the  actual  displacement  at 
the  point  considered, 

[£U>  OU,  [&~ 

will  represent  the  average  values  of  these  components  in  the  small 
sphere  about  that  point  These  average  values  we  shall  treat  as 
functions  of  the  coordinates  of  the  center  of  the  sphere  and  of  the 
time,  and  may  call  them,  for  brevity,  the  average  values  of  £  9,  £ 
But  however  they  may  be  designated,  it  is  essential  to  remember  that 
it  is  a  space-average  for  a  certain  very  small  space,  and  never  a  time- 
average,  that  is  intended. 

The  object  of  this  paper  will  be  accomplished  when  we  have 
expressed  (explicitly  or  implicitly)  the  relations  which  subsist 
between  the  values  of  [£]*„,  [tfli™,  [£]a-»>  &t  different  times  and  in 
different  parts  of  the  field, — in  other  words,  when  we  have  found 
the  conditions  which  these  quantities  must  satisfy  as  functions  of 
the  time  and  the  coordinates. 

3.  Let  us  suppose  that  luminous  vibrations  of  any  one  period  *  are 
somewhere  excited,  and  that  the  disturbance  is  propagated  through 
the  medium.  The  motions  which  are  excited  in  any  part  of  the 
medium,  and  the  forces  by  which  they  are  kept  up,  will  be  expressed 
by  harmonic  functions  of  the  time,  having  the  same  period,t  as  may 
be  proved  by  the  single  principle  of  the  superposition  of  motions 
quite  independently  of  any  theory  of  the  constitution  of  the  medium, 
or  of  the  nature  of  the  motions,  as  electrical  or  otherwise.  This  is 
equally  true  of  the  actual  motions,  and  of  the  averages  which  we  are 
to  consider.    We  may  therefore  set 


[£]AT*-CH<^y'+^^y'>l  (i) 


2tt_        .    2^ 

50fT 

eta, 


*  There  is  110  real  loss  of  generality  in  making  the  light  monoohromatio,  sinoe  in  every 
ease  it  may  be  divided  into  parte,  whioh  are  separately  propagated,  and  each  of  whioh 
is  monoohromatio  to  any  required  degree  of  approximation. 

t  It  is  of  course  possible  that  the  expressions  for  the  forces  and  displacements  should 
have  constant  terms.  But  these  will  disappear,  if  the  displacements  are  measured  from 
the  state*  of  equilibrium  about  whioh  the  system  vibrates,  and  we  leave  out  of  account 
in  measuring  the  foroes  (and  the  eleotrostatio  potential)  that  whioh  would  belong  to  the 
system  in  the  state  of  equilibrium.  To  prevent  misapprehension,  it  should  be  added 
that  the  term  electrical  displacement  is  not  used  in  the  restricted  sense  of  dielectric 
displacement  or  polarization.  The  variation  of  the  eleotrioal  displacement,  as  the  term 
is  used  in  this  paper,  constitutes  what  Maxwell  calls  the  total  motion  of  eleotrioity  or 
true  current,  and  what  he  divides  into  two  parts,  whioh  he  distinguishes  as  the  current 
of  oonduotion  and  the  variation  of  the  eleotrioal  displacement.  Such  a  division  of 
the  total  motion  of  eleotrioity  is  not  necessary  for  the  purposes  of  this  paper,  and 
the  term  displacement  is  used  with  reference  to  the  total  motion  of  eleotrioity  in  a 
manner  entirely  analogous  to  that  in  whioh  the  term  is  ordinarily  used  in  the  theory  of 
wave-motion. 
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where  t  denotes  the  time,  p  the  period,  and  04,  Oj,  functions  of  the 
coordinates.    It  follows  that 


[f  jATe=» I3"[£]a?» 


i 


(2) 

etc 

4.  Now,  on  the  electrical  theory,  these  motions  are  excited  by 
electrical  forces,  which  are  of  two  kinds,  distinguished  as  electro- 
static and  electrodynamic.  The  electrostatic  force  is  determined  by 
the  electrostatic  potential  If  we  write  q  for  the  actual  value  of  the 
potential,  and  [g]Are  for  its  value  as  averaged  in  the  manner  specified 
above,  the  components  of  the  actual  electrostatic  force  will  be 


and  for  the  average  values  of  these  components  in  the  small  spaces 
described  above  we  may  write 

<3[?]at.  <*[?]at.  <*[g]AT» 

ax    *  dy    9  da    * 

for  it  will  make  no  difference  whether  we  take  the  average  before 
or  after  differentiation. 

5.  The  electrodynamic  force  is  determined  by  the  acceleration  of 
electrical  flux  in  all  parts  of  the  field,  but  physicists  are  not  entirely 
agreed  in  regard  to  the  laws  by  which  it  is  determined.  This 
difference  of  opinion  is  however  of  less  importance,  since  it  will  not 
affect  the  result  if  electrical  fluxes  are  always  solenoidal.  According 
to  the  most  simple  law,  the  components  of  the  force  are  given  by 
the  volume-integrals 

-#«•  -%>■  -jn$* 

where  dv  represents  an  element  of  volume,  and  r  the  distance  of  this 
element  from  the  point  for  which  the  value  of  the  electromotive 
force  is  to  be  determined.  In  other  words,  the  components  of  the 
force  at  any  point  are  determined  from  the  components  of  acceleration 
in  all  parts  of  the  field  by  the  same  process  by  which  (in  the  theories 
of  gravitation,  etc.)  the  value  of  the  potential  at  any  point  is  deter- 
mined from  the  density  of  matter  in  all  parts  of  space,  except  that 
the  sign  is  to  be  reversed.  Adopting  this  law,  provisionally  at  least, 
we  may  express  it  by  saying  that  the  components  of  electrodynamic 
force  are  equal  to  the  potentials  taken  negatively  of  the  components 
of  acceleration  of  electrical  flux.    And  we  may  write,  for  brevity, 

-Pot£         -Pot^,         -Pot£ 
for  the  components  of  force,  using  the  symbol  Pot  to  denote  the 
operation  by  which  the  potential  of  a  mass  is   derived  from   its 
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density.    For  the  average  values  of  these  components  in  the  small 
spaces  defined  above,  we  may  write 

-Pot[&„,        -Pot[iJW        -Pot[&w 

since  it  will  make  no  difference  whether  we  take  the  average  before 
or  after  the  operation  of  taking  the  potential 

6.  If  we  write  X,  Y,  Z  for  the  components  of  the  total  electromotive 
force  (electrostatic  and  electrodynamic),  we  have 

pq*.— Pot[&„-^=.  j.  (3) 

eta, 


or  by  (2) 


} 


eta 

It  will  be  convenient  to  represent  these  relations  by  a  vector 
notation.  If  we  represent  the  displacement  by  XT,  and  the  electro- 
motive force  by  B,  the  three  equations  of  (3)  will  be  represented  by 
the  single  vector  equation 

WU-  -Pot  [U]A„- VfeU  (5) 

and  the  three  equations  of  (4)  by  the  single  vector  equation 


WU-^?  **[*]*.-*[*!*.. 


(6) 


where,  in  accordance  with  quaternionic  usage,  V[g]ATt  represents  the 
vector  which  has  for  components  the  derivatives  of  [q]x^  with  respect 
to  rectangular  coordinates.  The  symbol  Pot  in  such  a  vector  equation 
signifies  that  the  operatiQn  which  is  denoted  by  this  symbol  in  a 
scalar  equation  is  to  be  performed  upon  each  of  the  components  of 
the  vector. 

7.  We  may  here  observe  that  if  we  are  not  satisfied  with  the  law 
adopted  for  the  determination  of  electrodynamic  force  we  have  only 
to  substitute  for  —Pot  in  these  vector  equations,  and  in  those  which 
follow,  the  symbol  for  the  operation,  whatever  it  may  be,  by  which 
we  calculate  the  electrodynamic  force  from  the  acceleration*  For 
the  operation  must  be  of  such  a  character  that  if  the  acceleration 
consist  of  any  number  of  parts,  the  force  due  to  the  whole  acceleration 
will  be  the  resultant  of  the  forces  due  to  the  separate  parts.  It  will 
evidently  make  no  difference  whether  we  take  an  average  before  or 
after  such  an  operation. 

*  The  tame  would  not  be  true  of  the  corresponding  scalar  equations,  (3)  and  (4).  For 
one  component  of  the  foroe  might  depend  upon  all  the  components  of  acceleration. 
8uoh  is  in  fact  the  case  with  the  law  of  electromotive  force  proposed  by  Weber. 
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8.  Let  us  now  examine  the  relation  which  subsists  between  the 
values  of  [E]Aye  and  [V]krt  for  the  same  point,  that  is,  between  the 
average  electromotive  force  and  the  average  displacement  in  a  small 
sphere  with  its  center  at  the  point  considered  We  have  already  seen 
that  the  forces  and  the  displacements  are  harmonic  functions  of  the 
time  having  a  common  period. 

A  little  consideration  will  show  that  if  the  average  electromotive 
force  in  the  sphere  is  given  as  a  function  of  the  time,  the  displace- 
ments in  the  sphere,  both  average  and  actual,  must  be  entirely 
determined.  Especially  will  this  be  evident,  if  we  consider  that  siuce 
we  have  made  the  radius  of  the  sphere  very  small  in  comparison  with 
a  wave-length,  the  average  force  must  have  sensibly  the  same  value 
throughout  the  sphere  (that  is,  if  we  vary  the  position  of  the  center 
of  the  sphere  for  which  the  average  is  taken  by*  a  distance  not  greater 
than  the  radius,  the  value  of  the  average  will  not  be  sensibly  affected), 
and  that  the  difference  of  the  actual  and  average  force  at  any  point 
is  entirely  determined  by  the  motions  in  the  immediate  vicinity  of 
that  point  If,  then,  certain  oscillatory  motions  may  be  kept  up  in 
the  sphere  under  the  influence  of  electrostatic  and  electrodynamic 
forces  due  to  the  motion  in  the  whole  field,  and  if  we  suppose  the 
motions  in  and  very  near  that  sphere  to  be  unchanged,  but  the  motions 
in  the  remoter  parts  of  the  field  to  be  altered,  only  not  so  as  to  affect 
the  average  resultant  of  electromotive  force  in  the  sphere,  the  actual 
resultant  of  electromotive  force  will  also  be  unchanged  throughout 
the  sphere,  and  therefore  the  motions  in  the  sphere  will  still  be  such 
as  correspond  to  the  forces. 

Now  the  average  displacement  is  a  harmonic  function  of  the  time 
having  a  period  which  we  suppose  given.  It  is  therefore  entirely 
determined  for  the  whole  time  the  vibrations  continue  by  the  values 
of  the  six  quantities 

[£Ut«,       Mat«>       [&».       [£Ur.,       Mat.,       [£|at. 

at  any  one  instant.  For  the  same  reason  the  average  electromotive 
force  is  entirely  determined  for  the  whole  time  by  the  values  of  the 
six  quantities 

[X]*..    mA*e,    [ZU.    EX]ATe,    [tl^    [Z]ATt 

for  the  same  instant.  The  first  six  quantities  will  therefore  be 
functions  of  the  second,  and  the  principle  of  the  superposition  of 
motions  requires  that  they  shall  be  homogeneous  functions  of  the 
first  degree.  And  the  second  six  quantities  will  be  homogeneous 
functions  of  the  first  degree  of  the  first  six.  The  coefficients  by  which 
these  functions  are  expressed  will  depend  upon  the  nature  of  the 
medium  in  the  vicinity  of  the  point  considered.     They  will  also 
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depend  upon  the  period  of  vibration,  that  is,  upon  ine  color  of  the 
light* 

We  may  therefore  write  in  vector  notation 

[B^-^tU^+^tU^  (7) 

where  $  and  >P  denote  linear  functions,  t 

The  optical  properties  of  media  are  determined  by  the  form  of 
these  functions.  But  all  forms  of  linear  functions  would  not  be 
consistent  with  the  principle  of  the  conservation  of  energy. 

In  media  which  are  more  or  less  opaque,  and  which  therefore 
absorb  energy,  ¥  must  be  of  such  a  form  that  the  function  always 
makes  an  acute  angle  (or  none)  with  the  independent  variable.  In 
perfectly  transparent  media,  ¥  must  vanish,  unless  the  function  is 
at  right  angles  to  the  independent  variable.  So  far  as  is  known,  the 
last  occurs  only  when  the  medium  is  subject  to  magnetic  influence. 
In  perfectly  transparent  media,  the  principle  of  the  conservation  of 
energy  requires  that  $  should  be  self-conjugate,  i.e.;  that  for  three 
directions  at  right  angles  to  one  another,  the  function  and  independent 
variable  should  coincide  in  direction. 

In  all  isotropic  media  not  subject  to  magnetic  influence,  it  is  prob- 
able that  $  and  ¥  reduce  to  numerical  coefficients,  as  is  certainly 
the  case  with  $  for  transparent  isotropic  media. 

9.  Comparing  the  two  values  of  [B]AT#,  we  have 

Y  **Pn4-- vMaw-*EO]*.+*[*]*..  (8) 

This  equation,  in  connection  with  that  by  which  we  express  the  sole- 
noidal  character  of  the  displacements,  if  we  regard  them  as  necessarily 
solenoidal,  or  in  connection  with  that  which  expresses  the  relation 
between  the  electrostatic  potential  and  the  displacements,  if  we  reject 
the  solenoidal  hypothesis,  may  be  regarded  as  the  general  equation  of 
the  vibrations  of  monochromatic  light,  considered  as  oscillating  elec- 
trical fluxes.  For  the  symbol  Pot,  however,  we  must  substitute  the 
symbol  representing  the  operation  by  which  electromotive  force  is 
calculated  from  acceleration  of  flux,  with  the  negative  sign,  if  we  are 
not  satisfied  with  the  law  provisionally  adopted. 

*  The  relations  between  the  displacement*  in  one  of  the  small  spaoes  considered  and 
the  average  electromotive  force  is  mathematically  analogous  to  the  relation  between  the 
displacements  in  a  system  of  a  high  degree  of  complexity  and  certain  forces  exerted 
from  without,  whioh  are  harmonic  fonotions  of  the  time  and  under  the  influence  of 
whioh  the  system  vibrates.  The  ratio  of  the  displacements  to  the  forces  will  in  general 
vary  with  the  period,  and  may  vary  very  rapidly. 

An  example  in  which  these  functions  vary  very  rapidly  with  the  period  is  afforded 
by  the  phenomena  of  selective  absorption  and  abnormal  dispersion. 

t  A  veotor  is  said  to  be  a  linear  function  of  another,  when  the  three  components  of  the 
first  are  homogeneous  functions  of  the  first  degree  of  the  three  components  of  the  second. 


Digitized  by  VjOOQIC 


218       EQUATIONS  OF  MONOCHROMATIC  LIGHT  IN  MEDIA 

It  is  important  to  observe  that  the  existence  of  molecular  vibrations 
of  ponderable  matter,  due  to  the  passage  of  light  through  the  medium, 
will  not  affect  the  reasoning  by  which  this  equation  has  been  estab- 
lished, provided  that  the  nature  and  intensity  of  these  vibrations  in 
any  small  part  of  the  medium  (as  measured  by  a  wave-length)  are 
entirely  determined  by  the  electrical  forces  and  motions  in  that  part 
of  the  medium.  But  the  equation  would  not  hold  in  case  of  molecular 
vibrations  due  to  magnetic  force.  Such  vibrations  would  constitute 
an  oscillating  magnetization  of  the  medium,  which  has  already  been 
excluded  from  the  discussion. 

The  supposition  which  has  sometimes  been  made*  that  electricity 
possesses  a  certain  mass  or  inertia,  would  not  at  all  affect  the  validity 
of  the  equation. 

10.  The  equation  may  be  reduced  to  a  form  in  some  respects  more 
simple  by  the  use  of  the  so-called  imaginary  quantities.  We  shall 
write  i  for  V(~'1)-    H  we  differentiate  with  respect  to  the  time,  and 

47T* 

substitute  — ;j-j-[XJ]a**  for  [TJ]A?t,  we  obtain 

If  we  multiply  this  equation  by  i,  either  alone  or  in  connection  with 
any  real  factor,  and  add  it  to  the  preceding,  we  shall  obtain  an 
equation  which  will  be  equivalent  to  the  two  of  which  it  is  formed. 

Multiplying  by  —  |-  and  adding,  we  have 

If  we  set  W-prU-.£tO]4„.  (9) 

e=*+«^*,  (ii) 

our  equation  reduces  to 

*£PotW-VQ  =  9W.  (12) 

In  this  equation  9  denotes  a  complex  linear  vector  function,  i.e.,  a 
vector  function  of  which  the  X-,  Y-,  and  Z-components  are  expressed 
in  terms  of  the  X-,  Y-,  and  Z-components  of  the  independent  variable 
by  means  of  coefficients  of  the  form  a+ib.    W  is  a  bi vector  of  which 


•See  Weber,  Abhandl.  d:  K.  Sdchs.  QesdUch.  d.  Wits.,  vol  vi,  pp.  693-597;  Lorberg, 
OrdU'B  Journal,  voL  bri,  p.  55. 
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the  real  part  represents  the  averaged  displacement  [U]ATtl  and  the 
coefficient  of  i  the  rate  of  increase  of  the  same  multiplied  by  a 
constant  factor.  This  bivector  therefore  represents  the  average 
state  of  a  small  part  of  the  field  both  with  respect  to  position  and 
velocity.  We  may  also  say  that  the  coefficient  of  «  in  W  represents 
the  value  of  the  averaged  displacement  [U]^  at  a  time  one-quarter 
of  a  vibration  earlier  than  the  time  principally  considered. 

11.  It  may  serve  to  fix  our  ideas  to  see  how  W  is  expressed  as  a 
function  of  the  time.     We  may  evidently  set 

[U^-^cosye+AjSinye 

where  A^  and  A,  are  vectors  representing  the  amplitudes  of  the  two 
parts  into  which  the  vibration  is  resolved.    Then 

£{*]*•-  -Ajsinyj+Ajcosy*, 
and 

[U]AW-'^[nU=(A1-iA^(cos^+i8in^); 

that  is,  if  we  set  A=A1  — « A,, 

W=Ae*  (13) 

In  like  manner  we  may  obtain 

Q=--0«\  (H) 

where  g  is  a  biscalar,  or  complex  quantity  of  ordinary  algebra. 
Substituting  these  values  in  (12),  and  cancelling  the  common  factor 
containing  the  time,  we  have 

^PotA-V^GA.  (15) 

Our  equation  is  thus  reduced  to  one  between  A  and  g,  and  may  easily 
be  reduced  to  one  in  A  alone.*  Now  A  represents  six  numerical 
quantities  (viz.,  the  three  components  of  Aj,  and  the  three  of  Ag), 
which  may  be  called  the  six  components  of  amplitude.  The  equation, 
therefore,  substantially  represents  the  relations  between  the  six  com- 
ponents of  amplitude  in  different  parts  of  the  field. t  The  equation 
is,  however,  not  really  different  from  (12),  since  A  and  g  are  only 
particular  values  of  W  and  Q. 

*The  terms  VQ,  T?q  are  allowed  to  remain  in  these  equations,  because  the  best 
manner  of  eliminating  them  will  depend  somewhat  upon  our  admission  or  rejection  of 
the  solenoidal  hypothesis. 

t  The  representation  of  the  six  components  of  amplitude  by  a  single  letter  should  not 
be  regarded  as  an  analytical  artifice.     It  only  leaves  undivided  in  our  notation  that 
which  is  undivided  in  the  nature  of  things.     The  separation  of  the  six  components  of 
amplitude  is  artificial,  in  that  it  introduces  arbitrary  elements  into  the  discussion,  viz. 
the  directions  of  the  axes  of  the  coordinates,  and  the  zero  of  time. 
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12.  From  the  general  equation  given  above  (8,  12,  or  15),  in  con- 
nection with  the  solenoidal  hypothesis,  we  may  easily  derive  the  laws 
of  the  propagation  of  plane  waves  in  the  interior  of  a  sensibly  homo- 
geneous medium,  and  the  laws  of  reflection  and  refraction  at  surfaces 
between  such  media.  This  has  been  done  by  Maxwell,*  Lorentz,t  and 
others,}  with  fundamental  equations  more  or  less  similar. 

The  method,  however,  by  which  the  fundamental  equation  has 
been  established  in  this  paper  seems  free  from  certain  objections 
which  have  been  brought  against  the  ordinary  form  of  the  theory. 
As  ordinarily  treated,  the  phenomena  are  made  to  depend  entirely 
on  the  inductive  capacity  and  the  conductivity  of  the  medium,  in  a 
manner  which  may  be  expressed  by  the  equation 

ro--(MS$(F^^-vw4       <16> 

which  will  be  equivalent  to  (12),  if 

where  E  and  C  denote  in  the  most  general  case  the  linear  vector 
functions,  but  in  isotropic  bodies  the  numerical  coefficients,  which 
represent  inductive  capacity  and  conductivity.  By  a  simple  trans- 
formation {see  (9)  and  (10)},  this  equation  becomes 

«--£-£  <18> 

where  0"1  represents  the  function  inverse  to  0. 

Now,  while  experiment  appears  to  verify  the  existence  of  such  a 
law  as  is  expressed  by  equation  (12),  it  does  not  show  that  9  has  the 
precise  form  indicated  by  equation  (16).  In  other  words,  experiment 
does  not  satisfactorily  verify  the  relations  expressed  by  (16)  and  (17), 
if  K  and  C  are  understood  to  be  the  operators  (or,  in  isotropic  bodies, 
the  numbers)  which  represent  inductive  capacity  and  conductivity  in 
the  ordinary  sense  of  the  terms. 

The  discrepancy  is  most  easily  shown  in  the  most  simple  case,  when 
the  medium  is  isotropic  and  perfectly  transparent,  and  0  reduces  to  a 
numerical  quantity.    The  square  of  the  velocity  of  plane  waves  is 

then  equal  to  j-,  and  equation  (18)  would  make  it  independent  of  the 


*Phil.  Trans.,  voL  olv  (1865),  p.  459,  or  Treatise  on  Electricity  and  Magnetism, 
chap.  xx. 

tSehlOmilch's  ZeiXschrifi,  vol  xxii,  pp.  1-30  and  206-219 ;  xxiii,  pp.  197-210. 

J  See  Fitzgerald,  Phil.  Trans. ,  vol.  clxxi,  p.  691;  J.  J.  Thomson,  Phil.  Mag., 
(5),  vol  ix,  p.  284;  Rayleigh,  Phil.  Mag.  (5),  voL  xii,  p.  81. 

That  the  electromagnetic  theory  of  light  gives  the  conditions  relative  to  the  boundary 
of  different  media,  which  are  required  by  the  phenomena  of  reflection  and  refraction, 
was  first  shown  by  Helmholtz.     See  Crelle's  Journal,  vol.  lxxii  (1870),  p.  57. 
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period;  that  is,  would  give  no  dispersion  of  colors.  The  ease  is 
essentially  the  same  in  transparent  bodies  which  are  not  isotropic* 

The  case  is  worse  with  metals,  which  are  characterized  electrically 
by  great  conductivity,  and  optically  by  great  opacity.  In  their 
papers  cited  above,  Lorentz  and  Rayleigh  have  observed  that  the 
experiments  of  Jamin  on  the  reflection  of  light  from  metallic  surfaces 
would  often  require,  as  ordinarily  interpreted  on  the  electromagnetic 
theory,  a  negative  value  for  the  inductive  capacity  of  the  metal. 
This  would  imply  that  the  electrical  equilibrium  in  the  metal  is 
unstable.  The  objection,  therefore,  is  essentially  the  same  as  that 
which  Lord  Rayleigh  had  previously  made  to  Cauchy's  theory  of 
metallic  reflection,  viz.,  that  the  apparent  mechanical  explanation 
of  the  phenomena  is  illusory,  since  the  numerical  values  given  by 
experiment  as  interpreted  on  Cauchy's  theory  would  involve  an 
unstable  equilibrium  of  the  ether  in  the  metal,  t 

13.  All  this  points  to  the  same  conclusion — that  the  ordinary  view 
of  the  phenomena  is  inadequate.  The  object  of  this  paper  will  be 
accomplished,  if  it  has  been  made  clear  how  a  point  of  view  more  in 
accordance  with  what  we  know  of  the  molecular  constitution  of 
bodies  will  give  that  part  of  the  ordinary  theory  which  is  verified  by 
experiment,  without  including  that  part  which  is  in  opposition  to 
observed  facta  J 

*  See  note  to  the  first  paper  of  Lorentz,  cited  above,  Schldmiloh,  vol.  zxii,  p.  23. 

1 8ee  Phil.  Mag.  (4),  vol  xliii  (1872),  p.  321. 

$The  consideration  of  the  processes  which  we  may  suppose  to  take  place  in  the 
smallest  parts  of  a  body  through  which  light  is  transmitted,  farther  than  is  neoessary  to 
establish  the  general  equation  given  above,  is  foreign  to  the  design  of  this  paper.  Yet  a 
word  may  be  added  with  respect  to  the  difficulties  signalized  in  the  ordinary  form  of  the 
theory.  The  comparatively  simple  case  of  a  perfectly  transparent  body  has  been 
examined  more  in  detail  in  one  of  the  papers  already  cited,  where  there  is  given  an 
explanation  of  the  dispersion  of  colors  from  the  point  of  view  of  this  paper.  It  is  there 
shown  that  the  effect  of  the  non-homogeneity  of  the  body  in  its  smallest  parts  is  to  add 
a  term  to  the  expression  for  the  kinetic  energy  of  electrical  waves,  which  for  an  isotropic* 
body  may  be  roughly  described  as  similar  to  that  which  would  be  required  if  the 
electricity  had  a  certain  mass  or  inertia.  (See  especially  §§  7,  9  and  12,  [this  volume 
pages  185  ft])  The  same  must  be  true  of  media  of  any  degree  of  opacity.  Now  the 
difficulty  with  the  optical  properties  of  the  metals  is  that  the  real  part  of  G  (or  9"1) 
is  in  some  oases  negative.  This  implies  that  at  a  moment  of  greatest  displacement 
the  electromotive  force  is  in  the  direction  opposite  to  the  displacement,  instead  of 
having  the  same  direction,  as  in  transparent  isotropic  bodies.  Now  a  oertain  part  of 
the  electromotive  foroe  must  be  required  to  oppose  the  apparent  inertia,  and  another 
part  to  oppose  the  electrical  elasticity  of  the  medium.  These  parts  of  the  foroe  must 
have  opposite  directions.  In  transparent  bodies  the  latter  part  is  by  far  the  greater.  But 
it  need  not  surprise  us  that  the  former  should  be  the  greater  in  some  metals. 

It  has  been  remarked  by  Lorentz  that  the  difficulty  with  respqot  to  metals  would  be 
in  a  measure  relieved  if  we  should  suppose  electricity  to  have  the  property  of  inertia. 
(See  §  11  of  his  third  paper,  Schldmiloh's  Zeilschrifl,  vol.  xxiii,  p.  208.)  But  a  supposi- 
tion of  this  kind,  taken  literally,  would  involve  a  dispersion  of  colors  in  vacuo,  and  still 
be  inadequate,  as  Lorentz  remarks,  *x>  explain  the  phenomena  observed  in  metals. 
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While  the  writer  has  aimed  at  a  greater  degree  of  rigor  than  is 
usual  in  the  establishment  of  the  fundamental  equation  of  mono- 
chromatic light,  it  is  not  claimed  that  this  equation  is  absolutely 
exact.  The  contrary  is  evident  from  the  fact  that  the  equation 
does  not  embrace  the  phenomena  which  characterize  such  circularly 
polarizing  bodies  as  quartz.  This,  however,  only  implies  the  neglect 
of  extremely  small  quantities — very  small,  for  example,  as  compared 
with  those  which  determine  the  dispersion  of  colors.  In  one  of  the 
papers  already  cited*  the  case  of  a  perfectly  transparent  body  is 
treated  with  a  higher  degree  of  approximation,  so  as  to  embrace  the 
phenomena  in  question. 


*  See  page  195  of  this  volume. 
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XIV. 


A  COMPARISON  OF  THE  ELASTIC  AND  THE  ELECTRICAL 
THEORIES  OF  LIGHT  WITH  RESPECT  TO  THE  LAW  OF 
DOUBLE  REFRACTION  AND  THE  DISPERSION  OF  COLORS. 

[Jmeriean  Journal  of  Science,  ser.  3,  vol  xxxv,  pp.  467-476,  June,  1888.] 

It  is  claimed  for  the  electrical  *  theory  of  light  that  it  is  free  from 
serious  difficulties,  which  beset  the  explanation  of  the  phenomena  of 
light  by  the  dynamics  of  elastic  solids.  Just  what  these  difficulties 
are,  and  why  they  do  not  occur  in  the  explanation  of  the  same 
phenomena  by  the  dynamics  of  electricity,  has  not  perhaps  been 
shown  with  all  the  simplicity  and  generality  which  might  be  desired. 
Such  a  treatment  of  the  subject  is  however  the  more  necessary  on 
account  of  the  ever-increasing  bulk  of  the  literature  on  either  side, 
and  the  confusing  multiplicity  of  the  elastic  theories.  It  is  the  object 
of  this  paper  to  supply  this  want,  so  far  as  respects  the  propagation 
of  plane  waves  in  transparent  and  sensibly  homogeneous  media.  The 
simplicity  of  this  part  of  the  subject  renders  it  appropriate  for  the 
first  test  of  any  optical  theory,  while  the  precision  of  which  the 
experimental  determinations  are  capable,  renders  the  test  extremely 
rigorous. 

It  is  moreover,  as  the  writer  believes,  an  appropriate  time  for  the 
discussion  proposed,  since  on  one  hand  the  experimental  verification 
of  Fresnel's  Law  has  recently  been  carried  to  a  degree  of  precision  far 
exceeding  anything  which  we  have  had  before,t  and  on  the  other,  the 


*  The  term  tlectrical  seems  the  most  simple  and  appropriate  to  desoribe  that  theory  of 
light  which  makes  it  oonsist  in  electrical  motions.  The  oases  in  which  any  distinctively 
magnetio  action  is  involved  in  the  phenomena  of  light  are  so  exceptional,  that  it  is 
difficult  to  see  any  sufficient  reason  why  the  general  theory  should  be  called  electro- 
magnetic, unless  we  are  to  oall  all  phenomena  electromagnetic  which  depend  on  the 
motions  of  eleotrioity. 

t  In  the  recent  experiments  of  Professor  Hastings  relating  to  the  index  of  refraction  of 
the  extraordinary  ray  in  Iceland  spar  for  the  spectral  line  D,  and  a  wave-normal  inclined 
at  about  31°  to  the  optio  axis,  the  difierenoe  between  the  observed  and  the  calculated 
values  was  only  two  or  three  units  in  the  sixth  decimal  plaoe  (in  the  seventh  significant 
figure),  whioh  was  about  the  probable  error  of  the  determinations.  See  Am.  Jour.  8cL 
ser.  3,  vol  xxxv,  p.  00. 
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discovery  of  a  remarkable  theorem  relating  to  the  vibrations  of  a 
strained  solid  *  has  given  a  new  impulse  to  the  study  of  the  elastic 
theory  of  light. 

Let  us  first  consider  the  facts  to  which  a  correct  theory  must 
conform. 

It  is  generally  admitted  that  the  phenomena  of  light  consist 
in  motions  (of  the  type  which  we  call  wave-motions)  of  some- 
thing which  exists  both  in  space  void  of  ponderable  matter,  and 
in  the  spaces  between  the  molecules  of  bodies,  perhaps  also  in 
the  molecules  themselves.  The  kinematics  of  these  motions  is 
pretty  well  understood;  the  question  at  issue  is  whether  it  agrees 
with  the  dynamics  of  elastic  solids  or  with  the  dynamics  of 
electricity. 

In  the  case  of  a  simple  harmonic  wave-motion,  which  alone  we  need 
consider,  the  wave-velocity  (V)  is  the  quotient  of  the  wave-length  (I) 
by  the  period  of  vibration  (p).  These  quantities  can  be  determined 
with  extreme  accuracy.  In  media  which  are  sensibly  homogeneous 
but  not  isotropic  the  wave-velocity  V,  for  any  constant  value  of  the 
period,  is  a  quadratic  function  of  the  direction  cosines  of  a  certain 
line,  viz.,  the  normal  to  the  so-called  "plane  of  polarization."  The 
physical  characteristics  of  this  line  have  been  a  matter  of  dispute. 
Fresnel  considered  it  to  be  the  direction  of  displacement  Others 
have  maintained  that  it  is  the  common  perpendicular  to  the  wave- 
normal  and  the  displacement.  Others  again  would  define  it  as 
that  component  of  the  displacement  which  is  perpendicular  to  the 
wave-normal.  This  of  course  would  differ  from  Fresnel's  view  only 
in  case  the  displacements  are  not  perpendicular  to  the  wave-normal, 
and  would  in  that  case  be  a  necessary  modification  of  his  view. 
Although  this  dispute  has  been  one  of  the  most  celebrated  in 
physics,  it  seems  to  be  at  length  substantially  settled,  most  directly 
by  experiments  upon  the  scattering  of  light  by  small  particles, 
which  seems  to  show  decisively  that  in  isotropic  media  at  least 
the  displacements  are  normal  to  the  "plane  of  polarization,"  and 
also,  with  hardly  less  cogency,  by  the  difficulty  of  accounting 
for  the  intensities  of  reflected  and  refracted  light  on  any  other 


*  Sir  Wm.  Thomson  has  shown  that  if  an  elastio  incompressible  solid  in  which  the 
potential  energy  of  any  homogeneous  strain  is  proportional  to  the  sum  of  the  squares  of 
the  reciprocals  of  the  principal  elongations  minus  three  is  subjected  to  any  homogeneous 
strain  by  forces  applied  to  its  surface,  the  transmission  of  plane  waves  of  distortion, 
superposed  on  this  homogeneous  strain,  will  follow  exactly  Fresnel's  law  (including  the 
direction  of  displacement),  the  three  prinoipal  velocities  being  proportional  to  the 
reciprocals  of  the  principal  elongations.  It  must  be  a  surprise  to  mathematicians  and 
physicists  to  learn  that  a  theorem  of  such  simplicity  and  beauty  has  been  waiting  to  be 
discovered  in  a  field  which  has  been  so  carefully  gleaned.  See  page  116  of  the  current 
volume  (xxv)  of  the  Philosophical  Magazine, 
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supposition.*  It  should  be  added  that  all  diversity  of  opinion  on 
this  subject  has  been  confined  to  those  whose  theories  are  based  on 
the  dynamics  of  elastic  bodies.  Defenders  of  the  electrical  theory 
have  always  placed  the  electrical  displacement  at  right  angles  to  the 
"plane  of  polarization."  It  will,  however,  be  better  to  assume  this 
direction  of  the  displacement  as  probable  rather  than  as  absolutely 
certain,  not  so  much  because  many  are  likely  to  entertain  serious 
doubts  on  the  subject,  as  in  order  not  to  exclude  views  which  have 
at  least  a  historical  interest 

The  wave-velocity,  then,  for  any  constant  period,  is  a  quadratic 
function  of  the  cosines  of  a  certain  direction,  which  is  probably  that 
of  the  displacement,  but  in  any  case  determined  by  the  displacement 
and  the  wave-normal.  The  coefficients  of  this  quadratic  function  are 
functions  of  the  period  of  vibration.  It  is  important  to  notice  that 
these  coefficients  vary  separately,  and  often  quite  differently,  with  the 
period,  and  that  the  case  does  not  at  all  resemble  that  of  a  quadratic 
function  of  the  direction-cosines  multiplied  by  a  quantity  depending 
on  the  period. 

In  discussing  the  dynamics  of  the  subject  we  may  gain  something 
in  simplicity  by  considering  a  system  of  stationary  waves,  such  as 
results  from  two  similar  systems  of  progressive  waves  moving  in 
opposite  directions.  In  such  a  system  the  energy  is  alternately 
entirely  kinetic  and  entirely  potential.  Since  the  total  energy  is 
constant,  we  may  set  the  average  kinetic  energy  per  unit  of  volume 
at  the  moment  when  there  is  no  potential  energy,  equal  to  the  average 
potential  energy  per  unit  of  volume  when  there  is  no  kinetic  energy.! 
We  may  call  this  the  equation  of  energies.  It  will  contain  the 
quantities  I  and  p,  and  thus  furnish  an  expression  for  the  velocity 
of  either  system  of  progressive  waves.  We  have  to  see  whether  the 
elastic  or  the  electric  theory  gives  the  expression  most  conformed  to 
the  facta 

Let  us  first  apply  the  elastic  theory  to  the  case  of  the  so-called 

*  "  At  the  same  time,  if  the  above  reasoning  be  valid,  the  question  as  to  the  direction 
of  the  vibrations  in  polarized  light  is  decided  in  aooordanoe  with  the  view  of  FresneL 
...  I  oonfess  I  cannot  see  any  room  for  doubt  as  to  the  result  it  leads  to.  .  .  .  I  only 
mean  that  if  light,  as  is  generally  supposed,  oonsists  of  transversal  vibrations  similar  to 
those  whioh  take  place  in  an  elastic  solid,  the  vibration  must  be  normal  to  the  plane  of 
polarization. "  Lord  Rayleigh  "  On  the  Light  from  the  Sky,  its  Polarization  and  Color;" 
Phil  Mag.  (4),  xli  (1871),  p.  109. 

"  Green's  dynamios  of  polarization  by  reflexion,  and  Stokes*  dynamics  of  the  diffraotion 
of  polarized  light,  and  Stokes'  and  Rayleigh's  dynamios  of  the  blue  sky,  all  agree  in,  as 
it  seems  to  me,  irrefragably,  demonstrating  Fresnel's  original  conclusion,  that  in  plane 
polarized  light  the  line  of  vibration  is  perpendicular  to  the  plane  of  polarization." 
Sir  Wm.  Thomson,  loc.  dial. 

fThe  terms  kinetic  energy  and  potential  energy  will  be  used  in  this  paper  to  denote 
these  average  values. 
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vacuum.  If  we  write  h  for  the  amplitude  measured  in  the  middle 
between  two  nodal  planes,  the  velocities  of  displacement  will  be  as 

- ,  and  the  kinetic  energy  will  be  represented  by  A -5 ,  where  A  is  a 

constant  depending  on  the  density  of  the  medium.  The  potential 
energy,  which  consists  in  distortion  of  the  medium,  may  be  represented 

by  B^j,  where  B  is  a  constant  depending  on  the  rigidity  of  the 

medium.    The  equation  of  energies,  on  the  elastic  theory,  is  therefore 

A-*=B?  (1) 

which  gives  V*«5-|=--.  (2) 

In  the  electrical  theory,  the  kinetic  energy  is  not  determined  by  the 

simple  formula  of  ordinary  dynamics  from  the  square  of  the  velocity 

of  each  element,  but  is  found  by  integrating  the  product  of  the 

velocities  of  each  pair  of  elements  divided  by  the  distance  between 

them.    Very  elementary  considerations  suffice  to  show  that  a  quantity 

thus  determined  when  estimated  per  unit  of  volume  will  vary  as  the 

h* 
square  of  the  wave-length.    We  may  therefore  set  FP;3  for  the  kinetic 

energy,  F  being  a  constant.  The  potential  energy  does  not  consist 
in  distortion  of  the  medium,  but  depends^  upon  an  elastic  resistance 
to  the  separation  of  the  electricities,  which  constitutes  the  electrical 
displacement,  and  is  proportioned  to  the  square  of  this  displacement. 
The  average  value  of  the  potential  energy  per  unit  of  volume  will 
therefore  be  represented  in  the  electrical  theory  by  GAf ,  where  Q  is  a 
constant,  and  the  equation  of  energies  will  be 

W*J-GA«  (3) 

which  rives 

P     Q 

Both  theories  give  a  constant  velocity,  as  is  required.  But  it  is 
instructive  to  notice  the  profound  difference  in  the  equations  of  energy 
from  which  this  result  is  derived.  In  the  elastic  theory  the  square  of 
the  wave-length  appears  in  the  potential  energy  as  a  divisor;  in  the 
electrical  theory  it  appears  in  the  kinetic  energy  as  a  factor. 

Let  us  now  consider  how  these  equations  will  be  modified  by  the 
presence  of  ponderable  matter,  in  the  most  general  case  of  transparent 
and  sensibly  homogeneous  bodies.  This  subject  is  rendered  much 
more  simple  by  the  fact  that  the  distances  between  the  ponderable 
molecules  are  very  small  compared  with  a  wave-length.    Or,  what 
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amounts  to  the  same  thing,  bat  may  present  a  more  distinct  picture 
to  the  imagination,  the  wave-length  may  be  regarded  as  enormously 
great  in  comparison  with  the  distances  between  neighboring  molecules. 
Whatever  view  we  take  of  the  motions  which  constitute  light,  we  can 
hardly  suppose  them  (disturbed  as  they  are  by  the  presence  of  the 
ponderable  molecules)  to  be  in  strictness  represented  by  the  equations 
of  wave-motion.  Yet  in  a  certain  sense  a  wave-motion  may  and  does 
exist  If,  namely,  instead  of  the  actual  displacement  at  any  point, 
we  consider  the  average  displacement  in  a  space  large  enough  to 
contain  an  immense  number  of  molecules,  and  yet  small  as  measured 
by  a  wave-length,  such  average  displacements  may  be  represented 
by  the  equations  of  wave-motion;  and  it  is  only  in  this  sense  that 
any  theory  of  wave-motion  can  apply  to  the  phenomena  of  light  in 
transparent  bodies.  When  we  speak  of  displacements,  amplitudes, 
velocities  (of  displacement),  eta,  it  must  therefore  be  understood  in 
this  way. 

The  actual  kinetic  energy,  on  either  theory,  will  evidently  be 
greater  than  that  due  to  the  motion  thus  averaged  or  smoothed,  and 
to  a  degree  presumably  depending  on  the  direction  of  the  displace- 
ment But  since  displacement  in  any  direction  may  be  regarded  as 
compounded  of  displacements  in  three  fixed  directions,  the  additional 
energy  will  be  a  quadratic  function  of  the  components  of  velocity  of 
displacement,  or,  in  other  words,  a  quadratic  function  of  the  direction- 
cosines  of  the  displacement  multiplied  by  the  square  of  the  amplitude 
and  divided  by  the  square  of  the  period.*  This  additional  energy 
may  be  understood  as  including  any  part  of  the  kinetic  energy  of 
the  wave-motion  which  may  belong  to  the  ponderable  particles.  The 
term  to  be  added  to  the  kinetic  energy  on  the  electric  theory  may 

h% 
therefore  be  written  /d£s»  where  /D  is  a  quadratic  function  of  the 

direction-cosines  of  the  displacement  The  elastic  theory  requires  a 
term  of  precisely  the  same  character,  but  since  the  term  to  which  it 
is  to  be  added  is  of  the  same  general  form,  the  two  may  be  incor- 
porated in  a  single  term  of  the  form  AD-g,  where  AD  is  a  quadratic 

function  of  the  direction-cosines  of  the  displacement  We  must, 
however,  notice  that  both  Ad  and  /D  are  not  entirely  independent  of 
the  period.  For  the  manner  in  which  the  flux  of  the  luminiferous 
medium  is  distributed  among  the  ponderable  molecules  will  naturally 
depend  somewhat  upon  the  period.  The  same  is  true  of  the  degree 
to  which  the  molecules  may  be  thrown  into  vibration.  But  Aj>  and 
/D  will  be  independent  of  the  wave-length  (except  so  far  as  this  is 

*  For  proof  t»  extesso  off  this  proposition,  when  the  motions  ere  supposed  eleotriosl, 
the  reader  is  referred  to  page  187  of  this  volume. 
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connected  with  the  period),  because  the  wave-length  is  enormously 
great  compared  with  the  size  of  the  molecules  and  the  distances 
between  them. 

The  potential  energy  on  the  elastic  theory  must  be  increased  by 
a  term  of  the  form  b^h*,  where  ^  is  a  quadratic  function  of  the 
direction-coednes  of  the  displacement.  For  the  ponderable  particles 
must  oppose  a  certain  elastic  resistance  to  the  displacement  of  the 
ether,  which  in  seolotropic  bodies  will  presumably  be  different  in 
different  directions.  The  potential  energy  on  the  electric  theory  will 
be  represented  by  a  single  term  of  the  same  form,  say  Qj>h*,  where 
a  quadratic  function  of  the  direction-cosines  of  the  displacement,  GD, 
takes  the  place  of  the  constant  Q,  which  was  sufficient  when  the 
ponderable  particles  were  absent.  Both  GD  and  b^  will  vary  to  some 
extent  with  the  period,  like  AD  and  /D,  and  for  the  same  reason. 

In  regard  to  that  potential  energy,  which  on  the  elastic  theory  is 

independent  of  the  direct  action,  of  the  ponderable  molecules,  it  has 

been  supposed  that  in  sBolotropic  bodies  the  effect  of  the  molecules  is 

such  as  to  produce  an  seolotropic  state  in  the  ether,  so  that  the  energy 

of  a  distortion  varies  with  its  orientation.    This  part  of  the  potential 

hs 
energy  will  then  be  represented  by  B^  ij  >  where  Bud  is  a  function  of 

the  directions  of  the  wave-normal  and  the  displacement.  It  may 
easily  be  shown  that  it  is  a  quadratic  function  both  of  the  direction- 
cosines  of  the  wave-normal  and  of  those  of  the  displacement.  Also, 
that  if  the  ether  in  the  body  when  undisturbed  is  not  in  a  state  of 
stress  due  to  forces  at  the  surface  of  the  body,  or  if  its  stress  is 
uniform  in  all  directions,  like  a  hydrostatic  pressure,  the  function  Bjn> 
must  be  symmetrical  with  respect  to  the  two  sets  of  direction-cosines. 
The  equation  of  energies  for  the  elastic  theory  is  therefore 


^"Di?  ~  ^NB  I*  "*"  ^^ ' 


(5) 


P         Bro 


which  gives  V*=-a= .      ">    „■  (6) 

The  equation  of  energies  for  the  electrical  theory  is 

™p+f»?=G»ht>  <7> 

which  gives  V«=!=^-^.  (8) 

It  is  evident  at  once  that  the  electrical  theory  gives  exactly  the 
form  that  we  want.  For  any  constant  period  the  square  of  the 
wave-velocity  is  a  quadratic  function  of  the  direction-cosines  of 
the   displacement.    When   the  period  varies,  this  function  varies, 
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the  different  coefficients  in  the  function  varying  separately,  because 
QD  and  /D  will  not  in  general  be  similar  functions.*  If  we  consider  a 
constant  direction  of  displacement  while  the  period  varies,  Gj>  and 
/d  will  only  vary  so  far  as  the  type  of  the  motion  varies,  Le.,  so  far 
as  the  manner  in  which  the  flux  distributes '  itself  among  the 
ponderable  molecules  and  intermolecular  spaces,  and  the  extent  to 
which  the  molecules  take  part  in  the  motion  are  changed.  There 
are  cases  in  which  these  vary  rapidly  with  the  period,  viz.,  casee 
of  selective  absorption  and  abnormal  dispersion.  But  we  may  fairly 
expect  that  there  will  be  many  cases  in  which  the  character  of  the 
motion    in  these  respects   will  not   vary  much   with  the  period. 

t=t  and  J-£  will  then  be  sensibly  constant  and  we  have  an  approximate 

expression  for  the  general  law  of  dispersion,  which  agrees  remarkably 
well  with  experiment.! 

If  we  now  return  to  the  equation  of  energies  obtained  from  the 
elastic  theory,  we  see  at  once  that  it  does  not  suggest  any  such 
relation  as  experiment  has  indicated,  either  between  the  wave- velocity 
and  the  direction  of  displacement,  or  between  the  wave-velocity  and 
the  period.  It  remains  to  be  seen  whether  it  can  be  brought  to 
agree  with  experiment  by  any  hypothesis  not  too  violent. 

In  order  that  V*  may  be  a  quadratic  function  of  any  set  of 
direction-cosines,  it  is  necessary  that  AD  and  bj>  shall  be  independent 
of  the  direction  of  the  displacement,  in  other  words,  in  the  case  of 
a  crystal  like  Iceland  spar,  that  the  direct  action  of  the  ponderable 
molecules  upon  the  ether,  shall  affect  both  the  kinetic  and  the 
potential  energy  in  the  same  way,  whether  the  displacement  take 
place  in  the  direction  of  the  optic  axis  or  at  right  angles  to  it. 
This  is  contrary  to  everything  which  we  should  expect  If,  never- 
theless, we  make  this  supposition,  it  remains  to  consider  Bjd.  This 
must  be  a  quadratic  function  of  a  certain  direction,  which  is  almost 
certainly  that  of  the  displacement.  If  the  medium  is  free  from 
external  stress  (other  than  hydrostatic),  Bjd,  as  we  have  seen,  is 
symmetrical  with  respect  to  the  wave-normal  and  the  direction 
of  displacement,  and  a  quadratic  function  of  the  direction-cosines 
of  each.  The  only  single  direction  of  which  it  can  be  a  function  is 
the  common  perpendicular  to  these  two  directions.  If  the  wave- 
normal  and  the  displacement  are  perpendicular,  the  direction-cosines 

*  But  G»,/*,  and  V,  considered  as  function!  of  the  direction  of  displacement,  are  all 
subject  to  any  law  of  symmetry  whioh  may  belong  to  the  structure  of  the  body 
considered.  The  resulting  optical  characteristics  of  the  different  orystallographio 
systems  are  given  on  pages  192-194. 

fThis  will  appear  most  distinctly  if  we  consider  that  V  divided  by  the  velocity  of 
light  in  vacuo  gives  the  reciprocal  of  the  index  of  refraction,  and  p  multiplied  by  the 
same  quantity  gives  the  wave-length  in  vacuo. 
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of  the  common  perpendicular  to  both  will  be  linear  functions  of  the 
direction-cosines  of  each,  and  a  quadratic  function  of  the  direction- 
cosines  of  the  common  perpendicular  will  be  a  quadratic  function  of 
the  direction-cosines  of  each.  We  may  thus  reconcile  the  theory 
with  the  law  of  double  refraction,  in  a  certain  sense,  by  supposing 
that  AD  and  6D  are  independent  of  the  direction  of  displacement, 
and  that  B^  and  therefore  V2  is  a  quadratic  function  of  the  direction- 
cosines  of  the  common  perpendicular  to  the  wave-normal  and  the 
displacement.  But  this  supposition,  besides  its  intrinsic  improbability 
so  far  as  AD  and  6©  are  concerned,  involves  a  direction  of  the  dis- 
placement which  is  certainly  or  almost  certainly  wrong. 

We  are  thus  driven  to  suppose  that  the  undisturbed  medium  is  in 
a  state  of  stress,  which,  moreover,  is  not  a  simple  hydraulic  stress. 
In  this  case,  by  attributing  certain  definite  physical  properties  to 
the  medium,  we  may  make  the  function  B^  become  independent  of 
the  direction  of  the  wave-normal,  and  reduce  to  a  quadratic  function 
of  the  direction-cosines  of  the  displacement.*  This  entirely  satisfies 
Fresnel's  Law,  including  the  direction  of  displacement,  if  we  can 
suppose  AD  and  bj>  independent  of  the  direction  of  displacement. 
But  this  supposition,  in  any  case  difficult  for  aeolotropic  bodies, 
seems  quite  irreconcilable  with  that  of  a  permanent  (not  hydrostatic) 
stress. 

For  this  stress  can  only  be  kept  up  by  the  action  of  the  ponderable 
molecules,  and  by  a  sort  of  action  which  hinders  the  passage  of  the 
ether  past  the  molecules.  Now  the  phenomena  of  reflection  and 
refraction  would  be  very  different  from  what  they  are,  if  the  optical 
homogeneity  of  a  crystal  did  not  extend  up  very  close  to  the  surface. 
This  implies  that  the  stress  is  produced  by  the  ponderable  particles 
in  a  very  thin  lamina  at  the  surface  of  the  crystal,  much  less  in 
thickness,  it  would  seem  probable,  than  a  wave-length  of  yellow 
light.  And  this  again  implies  that  the  power  of  the  ponderable 
particles  to  pin  down  the  ether,  as  it  were,  to  a  particular  position  is 
very  great,  and  that  the  term  in  the  energy  relating  to  the  motion 
of  the  ether  relative  to  the  ponderable  particles  is  very  important. 
This  is  the  term  containing  the  factor  fcD,  which  it  is  difficult  to 
suppose  independent  of  the  direction  of  displacement  because  the 
dimensions  and  arrangement  of  the  particles  are  different  in  different 
directions.  But  our  present  hypothesis  has  brought  in  a  new 
reason  for  supposing  6D  to  depend  on  the  direction  of  displacement, 
viz.,  on  account  of  the  stress  of  the  medium.  A  general  displacement 
of  the  medium  midway  between  two  nodal  planes,  when  it  is 
restrained  at  innumerable  points  by  the  ponderable  particles,  will 


*  See  note  on  page  224. 
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produce  special  distortions  due  to  these  particles.  The  nature  of 
these  distortions  is  wholly  determined  by  the  direction  of  displace- 
ment, and  it  is  hard  to  conceive  of  any  reason  why  the  energy  of 
these  distortions  should  not  vary  with  the  direction  of  displacement, 
like  the  energy  of  the  general  distortion  of  the  wave-motion,  which 
is  partly  determined  by  the  displacement  and  partly  by  the  wave- 
normal* 

But  the  difficulties  of  the  elastic  theory  do  not  end  with  the 
law  of  double  refraction,  although  they  are  there  more  conspicuous 
on  account  of  the  definite  and  simple  law  by  which  they  can  be 
judged.  It  does  not  easily  appear  how  the  equation  of  energies 
can  be  made  to  give  anything  like  the  proper  law  of  the  dispersion 
of  colore.  Since  for  given  directions  of  the  wave-normal  and  dis- 
placement, or  in  an  isotropic  body,  B^  is  constant,  and  also  AD  and  &d, 
except  so  far  as  the  type  of  the  vibration  varies,  the  formula  requires 
that  the  square  of  the  index  of  refraction  (which  is  inversely  as  Vs) 
should  be  equal  to  a  constant  diminished  by  a  term  proportional  to 
the  square  of  the  period,  except  so  far  as  this  law  is  modified  by  a 
variation  of  the  type  of  vibration.  But  experiment  shows  nothing 
like  this  law.  Now  the  variation  in  the  type  of  vibration  is  some- 
times very  important, — it  plays  the  leading  r61e  in  the  phenomena 
of  selective  absorption  and  abnormal  dispersion, — but  this  is  certainly 
not  always  the  case.  It  seems  hardly  possible  to  suppose  that  the 
type  of  vibration  is  always  so  variable  as  entirely  to  mask  the  law 
which  is  indicated  by  the  formula  when  AD  and  bj>  (with  B^)  are 
regarded  as  constant  This  is  especially  evident  when  we  consider 
that  the  effect  on  the  wave-velocity  of  a  small  variation  in  the  type  of 
vibration  will  be  a  small  quantity  of  the  second  order.t 

The  phenomena  of  dispersion,  therefore,  corroborate  the  conclusion 
which  seemed  to  follow  inevitably  from  the  law  of  double  refraction 
alone. 


*  The  reader  may  perhaps  ask  how  the  above  reasoning  is  to  be  reoonoiled  with  the 
fact  that  the  law  of  doable  refraotion  has  been  so  often  deduced  from  the  elastic  theory. 
The  troublesome  terms  are  bj>  and  the  variable  part  of  AD,  which  express  the  direct 
action  of  the  ponderable  molecules  on  the  ether.  So  far  as  the  (quite  limited)  reading 
and  recollection  of  the  present  writer  extend,  those  who  have  sought  to  derive  the  law 
of  double  refraotion  from  the  theory  of  elastic  solids  have  generally  either  neglected  this 
direct  action— a  negleot  to  which  Professor  Stokes  calls  attention  more  than  once  in  his 
celebrated  "  Report  on  Double  Refraction"  (Brit.  Assoc,  1802,  pp.  264,  288)— or  taking 
account  of  this  action  they  have  made  shipwreck  upon  a  law  different  from  Fresnol's  and 
contradicted  by  experiment. 

tSee  pages  190,  101  of  this  volume,  or  Lord  Rayleigh's  Theory  of  Sound,  vol  i, 
p.  84. 
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A  COMPARISON  OF  THE  ELECTRIC  THEORY  OF  LIGHT 
AND  SIR  WILLIAM  THOMSON'S  THEORY  OF  A  QUASI- 
LABILE  ETHER. 

[American  Journal  of  Science,  ser.  3,  voL  xxxvii,  pp.  139-144,  February,  1889.] 

A  remarkable  paper  by  Sir  William  Thomson,  in  the  November 
number  of  the  Philosophical  Magazine,  has  opened  a  new  vista  in 
the  possibilities  of  the  theory  of  an  elastic  ether.  Since  the  general 
theory  of  elasticity  gives  three  waves  characterized  by  different 
directions  of  displacement  for  a  single  wave-plane,  while  the  pheno- 
mena of  optics  show  but  two,  the  first  point  in  accommodating  any 
theory  to  observation,  is  to  get  rid  (absolutely  or  sensibly)  of  the 
third  wave.  For  this  end,  it  has  been  common  to  make  the  ether 
incompressible,  or,  as  it  is  sometimes  expressed,  to  make  the  velocity 
of  the  third  wave  infinite.  The  velocity  of  the  wave  of  compression 
becomes  in  fact  infinite  as  the  compressibility  vanishes.  Of  course 
it  has  not  escaped  the  notice  of  physicists  that  we  may  also  get 
rid  of  the  third  wave  by  making  its  velocity  zero,  as  may  be  done 
by  giving  certain  values  to  the  constants  which  express  the  elastic 
properties  of  the  medium,  but  such  values  have  appeared  impossible, 
as  involving  an  unstable  state  of  the  medium.  The  condition  of 
incompressibility,  absolute  or  approximate,  has  therefore  appeared 
necessary*  This  question  of  instability  has  now,  however,  been 
subjected  to  a  more  searching  examination,  with  the  result  that 
the  instability  does  not  really  exist  "provided  we  either  suppose  the 
mectium  to  extend  all  through  boundless  space,  or  give  it  a  fixed 
containing  vessel  as  its  boundary.'9  This  renders  possible  a  very 
simple  theory  of  light,  which  has  been  shown  to  give  Fresnel's  laws 
for  the  intensities  of  reflected  and  refracted  light  and  for  double 
refraction,  so  far  as  concerns  the  phenomena  which  can  be  directly 
observed.  The  displacement  in  an  aeolotropic  medium  is  in  the  same 
plane  passing  through  the  wave-normal  as  was  supposed  by  Fresnel, 

*  It  was  under  this  impression  that  the  paper  entitled  "  A  Comparison  of  the  Elastic 
and  the  Eleotrio  Theories  of  Light  with  respect  to  the  Law  of  Double  Refraction  and  the 
Dispersion  of  Colors,"  [this  volume  pp.  223-231],  was  written.  The  conclusions  of 
that  paper,  except  so  far  as  respects  the  dispersion  of  colors,  will  not  apply  to  the 
new  theory. 
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but  its  position  in  that  plane  is  different,  being  perpendicular  to  the 
ray  instead  of  to  the  wave-normal* 

It  is  the  object  of  this  paper  to  compare  this  new  theory  with 
the  electric  theory  of  light  In  the  limiting  cases,  that  is,  when  we 
regard  the  velocity  of  the  missing  wave  in  the  elastic  theory  as  zero, 
and  in  the  electric  theory  as  infinite,  we  shall  find  a  remarkable 
correspondence  between  the  two  theories,  the  motions  of  monochromatic 
light  within  isotropic  or  aeolotropic  media  of  any  degree  of  trans- 
parency or  opacity,  and  at  the  boundary  between  two  such  media, 
being  represented  by  equations  absolutely  identical,  except  that  the 
symbols  which  denote  displacement  in  one  theory  denote  force  in 
the  other,  and  vice  vers&A  In  order  to  exhibit  this  correspondence 
completely  and  clearly,  it  is  necessary  that  the  fundamental  principles 
of  the  two  theories  should  be  treated  with  the  same  generality,  and, 
so  far  as  possible,  by  the  same  method.  The  immediate  consequences 
of  the  new  theory  will  therefore  be  deduced  with  the  same  generality 
and  essentially  by  the  same  method  which  has  been  used  with 
reference  to  the  electric  theory  in  a  former  volume  of  this  Journal 
[page  211  of  this  volume]. 

The  elastic  properties  of  the  ether,  according  to  the  new  theory, 
vn  Us  limiting  case,  may  be  very  simply  expressed  by  means  of  a 
vector  operator,  for  which  we  shall  use  Maxwell's  designation.  The 
curl  of  a  vector  is  defined  to  be  another  vector  so  derived  from  the 
first  that  if  u,  v,  w  be  the  rectangular  components  of  the  first,  and 
u',  t/,  vf,  those  of  its  curl, 

~~Hy     dz'        ~~3s     <E>*        ~~dx~dy'  ^  ' 

where  x,  y,  z  are  rectangular  coordinates.  With  this  understanding, 
if  the  displacement  of  the  ether  is  represented  by  the  vector  6,  the 
force  exerted  upon  any  element  by  the  surrounding  ether  will  be 

—  B  curl  curl  <g  dxdy  dz,  (2) 

where  B  is  a  scalar  (the  so-called  rigidity  of  the  ether)  having  the 
same  constant  value  throughout  all  space,  whether  ponderable  matter 
is  present  or  not 

Where  there  is  no  ponderable  matter,  this  force  must  be  equated  to 
the  reaction  of  the  inertia  of  the  ether.  This  gives,  with  omission  of 
the  common  factor  dx  dy  dz, 

A<5  =  -  B  curl  curl  <&,  (8) 

where  A  denotes  the  density  of  the  ether. 

*  Sir  William  Thomson,  he.  citat.    R.  T.  Glazebrook,  Phil.  Mag.,  December,  1888. 

t  In  giving  us  a  new  interpretation  of  the  equations  of  the  electrio  theory,  the  author 
of  the  new  theory  has  in  fact  enriched  the  mathematical  theory  of  physios  with  some- 
thing whioh  may  be  compared  to  the  oelebrtted  principle  qf  duality  in  geometry. 
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The  presence  of  ponderable  matter  disturbs  the  motions  of  the 
ether,  and  renders  them  too  complicated  for  us  to  follow  in  detail. 
Nor  is  this  necessary,  for  the  quantities  which  occur  in  the  equations 
of  optics  represent  average  values,  taken  over  spaces  large  enough  to 
smooth  out  the  irregularities  due  to  the  ponderable  particles,  although 
very  small  as  measured  by  a  wave-length.*  Now  the  general  prin- 
ciples of  harmonic  motion  t  show  that  to  maintain  in  any  element  of 
volume  the  motion  represented  by 

iwiL 
«=««    p,  (4) 

8  being  a  complex  vector  constant,  will  require  a  force  from  outside 
represented  by  a  complex  linear  vector  function  of  6,  that  is,  the 
three  components  of  the  force  will  be  complex  linear  functions  of  the 
three  components  of  6.    We  shall  represent  this  force  by 

BV&dxdydz,  (5) 

where  ^  represents  a  complex  linear  vector  function.} 

If  we  now  equate  the  force  required  to  maintain  the  motion  in  any 
element  to  that  exerted  upon  the  element  by  the  surrounding  ether, 
we  have  the  equation       ^ ^^^  (6) 

which  expresses  the  general  law  for  the  motion  of  monochromatic 
light  within  any  sensibly  homogeneous  medium,  and  may  be  regarded 
as  implicitly  including  the  conditions  relating  to  the  boundary  of  two 
such  media,  which  are  necessary  for  determining  the  intensities  of 
reflected  and  refracted  light. 

For  let  ut  v,  w    be  the  components  of    <£, 

u't  if,v/        „  „  curl  <E, 

u",  v", «/'        „  „  curl  curl  (E, 

so  that  ,    dw^dv         >_^    §J^        /_*??    ^ 

^3^     3s*         "" dz~ dx'         ~dx~Jy' 

"    ^L    d}L      .,_&u!  _dnxf      „_dAf  jUm! % 
dy~dz'       ~~3z     ~3&f        ~~Ux     Hyy 
and  let  the  interface  be  perpendicular  to  the  axis  of  Z.    It  is  evident 

*  This  is  in  no  respect  different  from  what  is  always  tacitly  understood  in  the  theory 
of  sound,  where  the  displacements,  velocities,  densities  considered  are  always  snoh 
average  values.  But  in  the  theory  of  light,  it  is  desirable  to  have  the  fact  dearly  in 
mind  on  account  of  the  two  interpenetrating  media  (imponderable  and  ponderable),  the 
laws  of  light  not  being  in  all  respects,  the  same  as  they  would  be  for  a  single  homo- 
geneous medium. 

+  8ee  Lord  Rayleigh's  Theory  of  Sound,  vol.  i,  chapters  iv,  v. 

X  It  amounts  essentially  to  the  same  thing,  whether  we  regard  the  force  as  a  linear 

vector  function  of  (S  or  of  <£,  sinoe  these  differ  only  by  the  constant  factor  -—^ .     But 

there  are  some  advantages  in  expressing  the  force  as  a  function  of  <$,  because  the 
greater  part  of  the  force,  in  the  most  important  oases,  is  required  to  overcome  the  inertia 
of  the  ether,  and  is  thus  more  immediately  connected  with  ($. 
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that  if  v!  or  xf  is  discontinuous  at  the  interface,  the  value  of  u"  or  v" 
becomes  in  a  sense  infinite,  Le.,  curl  curl  €>  and  therefore  by  (6) 
¥6,  will  be  infinite.  Now  both  <g  and  ¥  are  discontinuous  at  the 
interface,  but  infinite  values  for  ^ffi  are  not  admissible.  Therefore 
u'  and  v'  are  continuous.  Again,  if  u  or  v  is  discontinuous,  v!  or  t/ 
will  become  infinite,  and  therefore  vf  or  tA  Therefore  u  and  v  are 
continuoua  These  conditions  may  be  expressed. in  the  most  general 
manner  by  saying  that  the  components  of  6  and  curl  $  parallel  to  the 
interface  are  continuous.  This  gives  four  complex  scalar  conditions, 
or  in  all  eight  scalar  conditions,  for  the  motion  at  the  interface,  which 
are  sufficient  to  determine  the  amplitude  and  phase  of  the  two 
reflected  and  the  two  refracted  rays  in  the  most  general  case.  It  is 
easy,  however,  to  deduce  from  these  four  complex  conditions,  two 
others,  which  are  interesting  and  sometimes  convenient.  It  is  evident 
from  the  definitions  of  vf  and  vf  that  if  u,  v,  vf,  and  xf  are  continuous 
at  the  interface  w  and  vf  will  also  be  continuous.  Now  —w"  is  equal 
to  the  component  of  ¥(£  normal  to  the  interface.  The  following 
quantities  are  therefore  continuous  at  the  interface : 

the  components  parallel  to  the  interface  of  (5, 
the  component  norma]  to  the  interface  of     ¥0,  (7) 

all  components  of  curl  d.t 

To  compare  these  results  with  those  derived  from  the  electrical 
theory,  we  may  take  the  general  equation  of  monochromatic  light  on 
the  electrical  hypothesis  from  a  paper  in  a  former  volume  of  this 
Journal.  This  equation,  which  with  an  unessential  difference  of  nota- 
tion may  be  written  * 

-Pot8-VQ=4T*&  (8) 

was  established  by  a  method  and  considerations  similar  to  those  which 
have  been  used  to  establish  equation  (6),  except  that  the  ordinary  law 
of  electrodynamic  induction  had  the  place  of  the  new  law  of  elasticity, 
gf  is  a  complex  vector  representing  the  electrical  displacement  as  a 
harmonic  function  of  the  time ;  #  is  a  complex  linear  vector  operator, 
such  that  4ir$3f  represents  the  electromotive  force  necessary  to  keep 
up  the  vibration  gf-    Q  is  a  complex  scalar  representing  the  electro- 
static potential,  VQ  the  vector  of  which  the  three  components  are 
dQ     dQ     dQ 
3jd'    dy'    dz' 
Pot  denotes  the  operation  by  which  in  the  theory  of  gravitation  the 
potential  is  calculated  from  the  density  of  matter.t     When  it  is 

*  See  page  218  of  this  volume,  equation  (12). 

t  The  symbol  -  Pot  is  therefore  equivalent  to  4tV-*,  as  used  by  Sir  William  Thomson 
(with  a  happy  economy  of  symbols)  at  the  last  meeting  of  British  Association  to  express 
the  same  law  of  electrodynamic  induotion,  exoept  that  the  symbol  is  here  used  as  a 
vector  operator.     See  NcUurt,  vol.  xxxviii,  p.  671,  sub  init. 


Digitized  by  VjOOQIC 


236       COMPARISON  OF  THE  ELECTRIC  THEORY  OF  LIGHT 

applied  as  here  to  a  vector,  the  three  components  of  the  result  are  to 
be  calculated  separately  from  the  three  components  of  the  operand. 
—  VQ  is  therefore  the  electrostatic  force,  and  —Pot  gf  the  electro- 
dynamic  force.  In  establishing  the  equation,  it  was  not  assumed  that 
the  electrical  motions  are  adenoidal,  or  such  as  to  satisfy  the  so-called 
"  equation  of  continuity."  We  may  now,  however,  make  this  assump- 
tion, since  it  is  the  extreme  case  of  the  electric  theory  which  we  are 
to  compare  with  the  extreme  case  of  the  elastic. 

It  results  from  the  definitions  of  curl  and  V  that  curl  VQ=0.  We 
may  therefore  eliminate  Q  from  equation  (8)  by  taking  the  curL 
This  gives 

-curl  Pot  $=4ir  curl  #$.  (9) 

Since  curl  curl  and  j—  Pot  are  inverse  operators  for  solenoidal  vectors, 

we  may  get  rid  of  the  symbol  Pot  by  taking  the  curl  again.  We 
thus  get 

-  ft = curl  curl  $$.  (10) 

The  conditions  for  the  motion  at  the  boundary  between  different 
media  are  easily  obtained  from  the  following  considerations.  Pot  Qr 
and  Q  are  evidently  continuous  at  the  interface.  Therefore  the 
components  parallel  to  the  interface  of  VQ,  and  by  (8)  of  $ft,  will  be 
continuous.  Again,  curl  Pot  jj  is  continuous  at  the  interface,  as 
appears  from  the  consideration  that  curl  Pot  j  is  the  magnetic  force 
due  to  the  electrical  motions  ft.  Therefore,  by  (9),  curl  $ft  is  con- 
tinuous. The  solenoidal  condition  requires  that  the  component  of  ft 
normal  to  the  interface  shall  be  continuous. 

The  following  quantities  are  therefore  continuous  at  the  interface : 

the  components  parallel  to  the  interface  of  #ft,        \ 

the  component  normal  to  the  interface  of    ft,  I        (11) 

all  components  of  curl  $ft.J 

Of  these  conditions,  the  two  relating  to  the  normal  components  of  ft 
and  curl  $ft  are  easily  shown  to  result  from  the  other  four  conditions, 
as  in  the  analogous  case  in  the  elastic  theory. 

If  we  now  compare  in  tne  two  theories  the  differential  equations  of 
the  motion  of  monochromatic  light  for  the  interior  of  a  sensibly 
homogeneous  medium,  (6)  and  (10),  and  the  special  conditions  for  the 
boundary  between  two  such  media  as  represented  by  the  continuity  of 
the  quantities  (7)  and  (11),  we  find  that  these  equations  and  conditions 
become  identical,  if 

ft=¥«,  (12) 

«=#ft,  (13) 

*=#-1.  (14) 
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In  other  words,  the  displacements  in  either  theory  are  subject  to  the 
same  general  and  surface  conditions  as  the  forces  required  to  maintain 
the  vibrations  in  an  element  of  volume  in  the  other  theory. 

To  fix  our  ideas  in  regard  to  the  signification  of  ¥  and  $,  we  may 
consider  the  case  of  isotropic  media,  in  which  these  operators  reduce 
to  ordinary  algebraic  quantities,  simple  or  complex.  Now  the  curl 
of  any  vector  necessarily  satisfies  the  solenoidal  condition  (the 
so-called  "equation  of  continuity"),  therefore  by  (6)  ¥ffi  and  d  will 
be  solenoidal.  So  also  will  g  and  $gf  in  the  electrical  theory.  Now 
for  solenoidal  vectors 

-cnrlcnrl-^+^+g,  (15) 

so  that  the  equations  (6)  and  (10)  reduce  to 

«•-(£+$+£>«•  <"> 

•-(£+£+£)■»  (»> 

For  a  simple  train  of  waves,  the  displacement,  in  either  theory,  may 
be  represented  by  a  constant  multiplied  by 

^+«*+6y+«)  (18) 

Our  equations  then  reduce  again  to 

^e=(a*+6*+c*)ffi,  (19) 

£»8f=(a*+&*+c*)#3f.  (20) 

Hence 

The  last  member  of  this  equation,  when  real,  evidently  expresses  the 
square  of  the  velocity  of  light    If  we  set 

*-»*±|±*.  (22) 

k  denoting  the  velocity  of  light  in  vacuo,  we  have 

n'-JP^-tf*-1.  (28) 

When  n*  is  positive,  which  is  the  case  of  perfectly  transparent 
bodies,  the  positive  root  of  n*  is  called  the  index  of  refraction  of  the 
medium.  In  the  most  general  case,  it  would  be  appropriate  to  call 
7*— or  perhaps  that  root  of  n*  of  which  the  real  part  is  positive— the 
(complex)  index  of  refraction,  although  the  terminology  is  hardly 
settled  in  this  respect  A  negative  value  of  n%  would  represent  a 
body  from  which  light  would  be  totally  reflected  at  all  angles  of 
incidence.  No  such  cases  have  been  observed.  Values  of  n*  in  which 
the  coefficient  of  i  is  negative,  indicate  media  in  which  light  is 
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absorbed.  Values  in  which  the  coefficient  of  *  is  positive  would 
represent  media  in  which  the  opposite  phenomenon  took  place.* 

It  is  no  part  of  the  object  of  this  paper  to  go  into  the  details 
by  which  we  may  derive,  so  far  as  observable  phenomena  are  con- 
cerned, Fresnel's  law  of  double  refraction  for  transparent  bodies, 
as  well  as  the  more  general  law  of  the  same  character  which  relates  to 
aeolotropic  bodies  of  more  or  less  opacity,  and  which  differs  from 
Fresnel's  only  in  that  certain  quantities  become  complex,  or  Freenels 
laws  for  the  intensities  of  reflected  and  refracted  light  at  the  boun- 
dary of  transparent  isotropic  media,  with  the  more  general  laws  for 
the  case  of  bodies  aeolotropic  or  opaque  or  both.  The  principal  cases 
have  already  been  discussed  on  the  new  elastic  theory  in  the 
Philosophical  Magazine  t  and  a  further  discussion  is  promised.  For 
the  electrical  theory,  the  case  of  double  refraction  in  perfectly  trans- 
parent media  has  been  discussed  quite  in  detail  in  this  Journal,  X  and 
the  intensities  of  reflected  and  refracted  light  have  been  abundantly 
deduced  from  the  above  conditions  by  various  authors.  §  So  far  as  all 
these  laws  are  concerned,  the  object  of  this  paper  will  be  attained  if 
if  it  has  been  made  clear  that  the  two  theories,  in  their  extreme  cases, 
give  identical  results.  The  greater  or  less  degree  of  elegance,  or 
completeness,  or  perspicuity,  with  which  these  laws  may  be  developed 
by  different  authors,  should  weigh  nothing  in  favor  of  either  theory. 

The  non-magnetic  rotation  of  the  plane  of  polarization,  with  the 
allied  phenomena  in  aeolotropic  bodies,  lie  in  a  certain  sense  outside 
of  the  above  laws,  as  depending  on  minute  quantities  which  have  been 
neglected  in  this  discussion.  The  manner  in  which  these  minute 
quantities  affect  the  equations  of  motion  on  the  electrical  theory  has 
been  shown  in  a  former  paper,  ||  where  these  phenomena  in  trans- 
parent bodies  are  treated  quite  at  length.  For  the  new  theory,  a 
discussion  of  this  subject  is  promised  by  Mr.  Olazebrook. 

But  the  magnetic  rotation  of  the  plane  of  polarization,  with  the 
allied  phenomena  when  an  aeolotropic  body  is  subjected  to  magnetic 
influence,  fall  entirely  within  the  scope  of  the  above  equations  and 
surface-conditions.  The  characteristic  of  this  case  is  that  ¥  and  $ 
are  not  self-conjugate,  f    This  is  what  we  might  expect  on  the  electric 

♦But  i  might  have  been  introduced  into  the  equations  in  Buch  a  way  that  a  positive 
coefficient  in  the  value  of  n*  would  indicate  absorption,  and  a  negative  ooeffioient  the 
impossible  case. 

t  Sir  William  Thomson,  loc.  citat.    R.  T.  Olazebrook,  loc  citat. 

I  This  voL  p.  182. 

gLorentz,  Sohlftmiloh's  Zeit*chr\fl>  voL  xxii,  pp.  1-30  and  205-210;  vol.  xziii, 
pp.  107-210;  Fitzgerald,  Phil.  Trans.,  vol  olxzi,  p.  001;  J.  J.  Thomson,  Phil. 
Mag.  (5),  voL  ix,  p.  284 ;  Rayieigh,  Phil.  Mag.  (5),  vol.  xii,  p.  81.  Olazebrook,  Proc 
Oambr.  Phil.  Soc,  vol  iv,  p.  165. 

ilThisvoLp.  105. 

t  See  this  voL  p.  217. 


Digitized  by  VjOOQIC 


AND  THE  THEORY  OF  A  QUASI-LABILE  ETHER  239 

theory  from  the  experiments  of  Dr.  Hall,  which  show  that  the 
operators  expressing  the  relation  between  electromotive  force  and 
current  are  not  in  general  self-conjugate  in  this  case. 

In  the  preceding  comparison,  we  have  considered  only  the  limiting 
cases  of  the  two  theories.  With  respect  to  the  sense  in  which  the 
limiting  case  is  admissible,  the  two  theories  do  not  stand  on  quite 
the  same  footing.  In  the  electric  theory,  or  in  any  in  which  the 
velocity  of  the  missing  wave  is  very  great,  if  we  are  satisfied  that 
the  compressibility  is  so  small  as  to  produce  no  appreciable  results, 
we  may  set  it  equal  to  zero  in  our  mathematical  theoryr  even  if  we 
do  not  regard  this  as  expressing  the  actual  facts  with  absolute 
accuracy.  But  the  case  is  not  so  simple  with  an  elastic  theory  in 
which  the  forces  resisting  certain  kinds  of  motion  vanish,  so  far 
at  least  as  they  are  proportional  to  the  strains.  The  first  requisite 
for  any  sort  of  optical  theory  is  that  the  forces  shall  be  proportional 
to  the  displacements.  This  is  easily  obtained  in  general  by  supposing 
the  displacements  very  small.  But  if  the  resistance  to  one  kind  of 
distortion  vanishes,  there  will  be  a  tendency  for  this  kind  of  distortion 
to  appear  at  some  places  in  an  exaggerated  form,  and  even  to  an 
infinite  degree,  however  small  the  displacements  may  be  in  other 
parts  of  the  field.  In  the  case  before  us,  if  we  suppose  the  velocity 
of  the  missing  wave  to  be  absolutely  zero,  there  will  be  infinite 
condensations  and  rarefactions  at  a  surface  where  ordinary  waves 
are  reflected.  That  is,  a  certain  volume  of  ether  will  be  condensed 
to  a  surface,  and  vice  verad.  This  prevents  any  treatment  of  the 
extreme  case,  which  is  at  once  simple  and  satisfactory.  The  difficulty 
has  been  noticed  by  Sir  William  Thomson,  who  observes  that  it 
may  be  avoided  if  we  suppose  the  displacements  infinitely  small  in 
comparison  with  the  wave-length  of  the  wave  of  compression.  This 
implies  a  finite  velocity  for  that  wave.  A  similar  difficulty  would 
probably  be  found  to  exist  (in  the  extreme  case)  with  regard  to 
the  deformation  of  the  ether  by  the  molecules  of  ponderable  matter, 
as  the  ether  oscillates  among  them.  If  the  statical  resistance  to 
irrotational  motions  is  zero,  it  is  not  at  all  evident  that  the  statical 
forces  evoked  by  the  disturbance  caused  by  the  molecules  would  be 
proportional  to  the  motions.  But  this  difficulty  would  be  obviated  by 
the  same  hypothesis  as  the  first. 

These  circumstances  render  the  elastic  theory  somewhat  less  con- 
venient as  .a  working  hypothesis  than  the  electric  They  do  not 
necessarily  involve  any  complication  of  the  equations  of  optics.  For 
it  may  still  be  possible  that  this  velocity  of  the  missing  wave  is 
so  small  that  the  quantities  on  which  it  depends  may  be  set  equal 
to  zero  in  the  equations  which  represent  the  phenomena  of  optics. 
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But  the  mental  processes  by  which  we  satisfy  ourselves  of  the 
validity  of  our  results  (if  we  do  not  work  out  the  whole  problem 
in  the  general  case  of  no  assumption  in  regard  to  the  velocity  of 
the  missing  wave)  certainly  involve  conceptions  of  a  higher  degree 
of  difficulty  on  account  of  the  circumstances  mentioned.  Perhaps 
this  ought  not  to  affect  our  judgment  with  respect  to  the  question 
of  thejruthof  the  hypothesis. 

Although  the  two  theories  give  laws  of  exactly  the  same  form 
for  monochromatic  light  in  the  limiting  case,  their  deviations  from 
this  limit  are  in  opposite  directions,  so  that  if  the  phenomena  of 
optics  differed  in  any  marked  degree  from  what  we  would  have  in 
the  limiting  case,  it  would  be  easy  to  find  an  experimentvmi  cruris 
to  decide  between  the  two  theories.  A  little  consideration  will  make 
it  evident,  that  when  the  principal  indices  of  refraction  of  a  crystal  are 
given,  the  intermediate  values  for  oblique  wave-planes  will  be  less  if 
the  velocity  of  the  missing  wave  is  small  but  finite,  than  if  it  is  infini- 
tesimal, and  will  be  greater  if  the  velocity  of  the  missing  wave  is  very 
great  than  if  it  is  infinite.*  Hence,  if  the  velocity  of  the  missing, 
wave  is  small  but  finite,  the  intermediate  values  of  the  indices  of 
refraction  will  be  less  than  are  given  by  Fresnel's  law,  but  if  the 
velocity  of  the  missing  wave  is  very  great  but  finite,  the  intermediate 
values  of  the  indices  of  refraction  will  be  greater  than  are  given  by 
Fresnel's  law.  But  the  recent  experiments  of  Professor  Hastings  on 
the  law  of  double  refraction  in  Iceland  spar  do  not  encourage  us  to 
look  in  this  direction  for  the  decision  of  the  question,  t 

In  a  simple  train  of  waves  in  a  transparent  medium,  the  potential 
energy,  on  the  elastic  theory,  may  be  divided  into  two  parts,  of  which 
one  is  due  to  that  general  deformation  of  the  ether  which  is  repre- 
sented by  the  equations  of  wave-motion,  and  the  other  to  those 
deformations  which  are  caused  by  the  interference  of  the  ponderable 
particles  with  the  wave-motion,  and  to  such  displacements  of  the 
ponderable  matter  as  may  be  caused,  in  some  cases  at  least,  by  the 
motion  of  the  ether.  If  we  write  h  for  the  amplitude,  I  for  the  wave- 
length, and  p  for  the  period,  these  two  parts  of  the  Statical  energy 
(estimated  per  unit  of  volume  for  a  space  including  many  wave- 
lengths) may  be  represented  respectively  by 

TT*Bh*        ,  bh* 

*  This  may  be  more  clear  if  we  consider  the  stationary  waves  formed  by  two  trains  of 
waves  moving  in  opposite  directions.    The  case  then  comes  under  the  following  theorem : 

"  If  the  system  undergo  such  a  change  that  the  potential  energy  of  a  given  configur- 
ation is  diminished,  while  the  kinetic  energy  of  a  given  motion  is  unaltered,  the  periods 
of  the  free  vibrations  are  all  increased,  and  conversely. "  See  Lord  Rayleigh's  Theory  of 
Sound,  vol.  i,  p.  85. 

iAm.  Jour.  /Set.,  ser  3,  vol.  xxxv,  p.  60. 
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The  sum  of  these  may  be  equated  to  the  kinetic  energy,  giving  an 
equation  of  the  form 

-T^+T555-?-  (24) 

B  is  an  absolute  constant  (the  rigidity  of  the  ether,  previously 
represented  by  the  same  letter),  A'  and  b  will  be  constant  (for  the 
same  medium  and  the  same  direction  of  the  wave-normal)  except 
so  far  as  the  type  of  the  motion  changes,  Le.,  except  so  far  as  the 
manner  in  which  the  motion  of  the  ether  distributes  itself  between 
the  ponderable  molecules,  and  the  degree  in*  which  these  take  part 
in  the  motion,  may  undergo  a  change.  When  the  period  of  vibration 
varies,  the  type  of  motion  will  vary  more  or  less,  and  A'  and  6  will 
vary  more  or  less. 

In  a  manner  entirely  analogous,*  the  kinetic  energy,  on  the  elec- 
trical theory,  may  be  divided  into  two  parts,  of  which  one  is  due  to 
those  general  fluxes  which  are  represented  by  the  equations  of  wave- 
motions,  and  the  other  to  those  irregularities  in  the  fluxes  which 
are  caused  by  the  presence  of  the  ponderable  molecules,  as  well  as  to 
such  motions  of  the  ponderable  particles  themselves  as  may  some- 
times occur.  These  parts  of  the  kinetic  energy  may  be  represented 
respectively  by 

f      **        P*  ' 
Their  sum  equated  to  the  potential  energy  gives 

TrWM^fh*    GA*  /0Kv 

Here  F  is  the  constant  of  electrodynamic  induction,  which  is  unity  if 
we  use  the  electromagnetic  system  of  units,  /  and  Q  (like  A'  and  6) 
vary  only  so  far  as  the  type  of  motion  varies. 

We  have  the  means  of  forming  a  very  exact  numerical  estimate 
of  the  ratio  of  the  two  parts  into  which  the  statical  energy  is  thus 
divided  on  the  elastic  theory,  or  the  kinetic  energy  on  the  electric 
theory.  The  means  for  this  estimate  is  afforded  by  the  principle 
that  the  period  of  a  natural  vibration  is  stationary  when  its  type  is 
inflnitesimally  altered  by  any  constraint,  t  Let  us  consider  a  case  of 
simple  wave-motion,  and  suppose  the  period  to  be  inflnitesimally 
varied:  the  wave-length  will  also  vary,  and  presumably  to  some 
extent  the  type  of  vibration.  But,  by  the  principle  just  stated,  if  the 
ether  or  the  electricity  could  be  constrained  to  vibrate  in  the  original 
type,  the  variations  of  I  and  p  would  be  the  same  as  in  the  actual 

*  See  page  182  of  this  volume. 

t  See  Lord  Rayleigh's  Theory  o/Sotmd,  voL  i,  p.  84.  The  application  of  the  principle 
is  most  simple  in  the  ease  of  stationary  waves. 
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ease.  Therefore,  in  finding  the  differential  equation  between  I  and  p, 
we  may  treat  b  and  A'  in  (24)  and/and  G  in  (25)  as  constant,  as  well 
as  B  and  F.    These  equations  may  be  written 


4r*B|J+6p*=4ir*A', 


Differentiating,  we  get 

4r«Bd^=-6d(^, 

xF<*JJ=-T*/d(p-*); 


or 


4x«B^diog|;=-6^dlog^ 
*FJUlogJJ=-^dlogjr*. 


Hence,  if  we  write  V  for  the  wave- velocity  (l/p),  n  for  the  index  of 
refraction,  and  X  for  the  wave-length  in  vacuo,  we  have  for  the  ratio 
of  the  two  parts  into  which  we  have  divided  the  potential  energy  on 
the  elastic  theory, 

bh*    v*Bfc*    dlogV        dlogn  /9a. 

T*n^fedT^"dto£\'  (26) 

and  for  the  ratio  of  the  two  parts  into  which  we  have  divided  the 
kinetic  energy  on  the  electrical  theory, 

^^^rFTO^dlogV^dlo^  (27) 

jp  '     p*       dlogp        dlog\ 

It  is  interesting  to  see  that  these  ratios  have  the  same  value.  This 
value  may  be  expressed  in  another  form,  which  is  suggestive  of  some 
important  relations.  If  we  write  IT  for  what  Lord  Rayleigh  has 
called  the  velocity  of  a  group  of  waves* 

u_i     dlogV 
7~l~dlogr 

rflogV_V-U 
dlogl        V""* 

It  appears,  therefore,  that  in  the  elastic  theory  that  part  of  the 
potential    energy    which    depends    on    the    deformation    expressed 

•  See  his  "Note  on  Progressive  Waves,"  Proe.  Lend.  Math.  Soc,  voL  ix,  No.  185, 
reprinted  in  his  Theory  of  Sound,  vol  ii,  p.  297. 
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by  the  equations  of  wave-motion,  bears  to  the  whole  potential 
energy  the  same  ratio  which  the  velocity  of  a  group  of  waves 
bears  to  the  wave-velocity.  In  the  electrical  theory,  that  part  of 
the  kinetic  energy  which  depends  on  the  motions  expressed  by  the 
equations  of  wave-motion  bears  to  the  whole  kinetic  energy  the  same 
ratio. 

Returning  to  the  consideration  of  equations  (26)  and  (27),  we 
observe  that  in  transparent  bodies  the  last  member  of  these  equations 
represents  a  quantity  which  is  small  compared  with  unity,  at  least  in 
the  visible  spectrum,  and  diminishes  rapidly  as  the  wave-length 
increases.  This  is  just  what  we  should  expect  of  the  first  member  of 
equation  (27).  But  when  we  pass  to  equation  (26),  which  relates  to 
the  elastic  theory,  the  case  is  entirely  different  The  fact  that  the 
kinetic  energy  is  affected  by  the  presence  of  the  ponderable  matter 
and  affected  differently  in  different  directions,  shows  that  the  motion 
of  the  ether  is  considerably  modified  This  implies  a  distortion  super- 
posed upon  the  distortion  represented  by  the  equations  of  wave- 
motion,  and  very  much  greater,  since  the  body  is  very  fine-grained  as 
measured  by  a  wave-length.  With  any  other  law  of  elasticity,  we 
should  suppose  that  the  energy  of  this  superposed  distortion  would 
enormously  exceed  that  of  the  regular  distortion  represented  by  the 
equations  of  wave-motion.  But  it  is  the  peculiarity  of  this  new  law 
of  elasticity  that  there  is  one  kind  of  distortion,  of  which  the  energy 
is  very  small,  and  which  is  therefore  peculiarly  likely  to  occur.  Now 
if  we  can  suppose  the  distortion  caused  by  the  ponderable  molecules 
to  be  almost  entirely  of  this  kind,  we  may  be  able  to  account  for  the 
smallness  of  its  energy.  We  should  still  expect  the  first  member  of 
(26)  to  increase  with  the  wave-length,  on  account  of  the  factor  P, 
instead  of  diminishing,  as  the  last  member  of  the  equation  shows  that 
it  does.  We  are  obliged  to  suppose  that  6,  and  therefore  the  type  of 
the  vibrations,  varies  very  rapidly  with  the  wave-length,  even  in  those 
cases  which  appear  farthest  removed  from  anything  like  selective 
absorption. 

The  electrical  theory  furnishes  a  relation  between  the  refractive 
power  of  a  body  and  its  specific  dielectric  capacity,  which  is  com- 
monly expressed  by  saying  that  the  latter  is  equal  to  the  square  of 
the  index  of  refraction  for  waves  of  infinite  length.  No  objection  can 
be  made  to  this  statement,  but  the  great  uncertainty  in  determining 
the  inden  for  waves  of  infinite  length  by  extrapolation  prevents  it 
from  furnishing  any  very  rigorous  test  of  the  theory.  Yet,  as  the 
results  of  extrapolation  in  some  cases  agree  strikingly  with  the 
specific  dielectric  capacity,  although  in  other  cases  they  are  quite 
different,  the  correspondence  is*  generally  regarded  as  corroborative,  in 
some  degree,  of  the  theory.    But  the  relation  between  refractive  power 
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and  dielectric  capacity  may  be  expressed  in  a  form  which  will  furnish 
a  more  rigorous  test,  as  not  involving  extrapolation. 

We  have  seen  on  page  242  how  we  may  determine  numerically  the 
ratio  of  the  two  first  terms  of  equation  (25).  We  thus  easily  get 
the  ratio  of  the  first  and  last  term,  which  gives 

Qh*_d\ogl  tFW 
4  'dlogX^p^'  V*' 

In  the  corresponding  equation  for  a  train  of  waves  of  the  same  ampli- 
tude and  period  in  vacuo,  I  becomes  X,  F  remains  the  same,  and  for  G 
we  may  write  G'.    This  gives 

G'A*    xFXW 


4  p* 

Dividing,  we  get 


(30) 


Q_rflog/    V  __d(V)  ,3n 

G'"3To^X  X2"~cJ(X*y  l    ' 

Now  G'  is  the  dielectric  elasticity  of  pure  ether.  If  K  is  the 
specific  dielectric  capacity  of  the  body  which  we  are  considering, 
G'/K  is  the  dielectric  elasticity  of  the  body  and  G'/2K  is  the  potential 
energy  of  the  body  (per  unit  of  volume),  due  to  a  unit  of  ordinary 
electrostatic  displacement  But  Qh*/4i  is  the  potential  energy  in  a 
train  of  waves  of  amplitude  h.  Since  the  average  square  of  the  dis- 
placement is  A.*/2,  the  potential  energy  of  a  unit  displacement  such  as 
occurs  in  a  train  of  waves  is  G/2.  Now  in  the  electrostatic  experi- 
ment the  displacement  distributes  itself  among  the  molecules  so  as  to 
make  the  energy  a  minimum.  But  in  the  case  of  light  the  distribu- 
tion of  the  displacement  is  not  determined  entirely  by  statical 
considerations.    Hence 

and 

K  >  ^l  (33) 

*=d(P) 

It  is  to  be  observed  that  if  we  should  assume  for  a  dispersion-formula 

n-*=a-6X-2,  (34) 

I/a,  which  is  the  square  of  the  index  of  refraction  for  an  infinite 
wave-length,  would  be  identical  with  the  second  member  of  (33). 

Another  similarity  between  the  electrical  and  optical  properties  of 
bodies  consists  in  the  relation  between  conductivity  and  opacity. 
Bodies  in  which  electrical  fluxes  are  attended  with  absorption  of 
energy  absorb  likewise  the  energy  of  the  motions  which  constitute 
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light.  This  is  strikingly  true  of  the  metals.  But  the  analogy  does 
not  stop  here.  To  fix  our  ideas,  let  us  consider  the  case  of  an 
isotropic  body  and  circularly  polarized  light,  which  is  geometrically 
the  simplest  case  although  its  analytical  expression  is  not  so  simple 
as  that  of  plane-polarized  light.  The  displacement  at  any  point 
may  be  symbolized  by  the  rotation  of  a  point  m  a  circle.  The 
external  force  necessary  to  maintain  the  displacement  3f  is  represented 
by  n~*$.  In  transparent  bodies,  for  which  n~*  is  a  positive  number, 
the  force  is  radial  and  in  the  direction  of  the  displacement,  being 
principally  employed  in  counterbalancing  the  dielectric  elasticity, 
which  tends  to  diminish  the  displacement  In  a  conductor  n~* 
becomes  complex,  which  indicates  a  component  of  the  force  in  the 
direction  of  %  that  is,  tangential  to  the  circle.  This  is  only  the 
analytical  expression  of  the  fact  above  mentioned.  But  there  is 
another  optical  peculiarity  of  metals,  which  has  c&used  much  remark, 
viz.,  that  the  real  part  of  n%  (and  therefore  of  n'*)  is  negative,  i.e., 
the  radial  component  of  the  force  is  directed  towards  the  center. 
This  inwardly  directed  force,  which  evidently  opposes  the  electro- 
dynamic  induction  of  the  irregular  part  of  the  motion,  is  small 
compared  with  the  outward  force  which  is  found  in  transparent 
bodies,  but  increases  rapidly  as  the  period  diminishes.  We  may  say, 
therefore,  that  metals  exhibit  a  second  optical  peculiarity, — that  the 
dielectric  elasticity  is  not  prominent  as  in  transparent  bodies.  This 
is  like  the  electrical  behavior  of  the  metals,  in  which  we  do  not 
observe  any  elastic  resistance  to  the  motion  of  electricity.  We  see, 
therefore,  that  the  complex  indices  of  metals,  both  in  the  real  and 
the  imaginary  part  of  their  inverse  squares,  exhibit  properties 
corresponding  to  the  electrical  behavior  of  the  metals. 

The  case  is  quite  different  in  the  elastic  theory.  Here  the  force 
from  outside  necessary  to  maintain  in  any  element  of  volume  the 
displacement  <£  is  represented  by  n*&.  In  transparent  bodies, 
therefore,  it  is  directed  toward  the  center.  In  metals,  there  is  a 
component  in  the  direction  of  the  motion  <£,  while  the  radial  part  of 
the  force  changes  its  direction  and  is  often  many  times  greater  than 
the  opposite  force  in  transparent  bodies.  This  indicates  that  in 
metals  the  displacement  of  the  ether  is  resisted  by  a  strong  elastic 
force,  quite  enormous  compared  to  anything  of  the  kind  in  trans* 
parent  bodies,  where  it  indeed  exists,  but  is  so  small  that  it  has  been 
neglected  by  most  writers  except  when  treating  of  dispersion.  We 
can  make  these  suppositions,  but  they  do  not  correspond  to 
anything  which  we  know  independently  of  optical  experiment. 

It  is  evident  that  the  electrical  theory  of  light  has  a  serious  rival, 
in  a  sense  in  which,  perhaps,  one  did  not  exist  before  the  publication 
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of  Sir  William  Thomson's  paper  in  November  last*  Nevertheless, 
neither  surprise  at  the  results  which  have  been  achieved,  nor 
admiration  for  that  happy  audacity  of  genius,  which,  seeking  the 
solution  of  the  problem  precisely  where  no  one  else  would  have 
ventured  to  look  for  it,  has  turned  half  a  century  of  defeat  into 
victory,  should  blind  us  to  the  actual  state  of  the  question. 

It  may  still  be  said  for  the  electrical  theory,  that  it  is  not  obliged 
to  invent  hypotheses, t  but  only  to  apply  the  laws  furnished  by 
the  science  of  electricity,  and  that  it  is  difficult  to  account  for  the 
coincidences  between  the  electrical  and  optical  properties  of  media, 
unless  we  regard  the  motions  of  light  as  electrical.  But  if  the 
electrical  character  of  light  is  conceded,  the  optical  problem  is  very 
different  from  anything  which  existed  in  the  time  of  Fresnel,  Cauchy, 
and  Green.  The  third  wave,  for  example,  is  no  longer  something 
to  be  gotten  rid  of  quocunque  modo>  but  something  which  we  must 
dispose  of  in  accordance  with  the  laws  of  electricity.  This  would 
seem  to  rale  out  the  possibility  of  a  relatively  small  velocity  for 
the  third  wave. 


*  "  Sinoe  the  first  publication  of  Oauohy's  work  on  the  subject  in  1880,  and  of  Green's 
in  1887,  many  attempts  have  been  made  by  many  workers  to  find  a  dynamical 
foundation  by  Fresnel's  laws  of  reflexion  and  refraction  of  light,  but  all  hitherto 
ineffectually."    Sir  William  Thomson,  foe.  cUat. 

"  So  far  as  I  am  aware,  the  eleotrio  theory  of  Maxwell  is  the  only  one  satisfying  these 
conditions  (of  explaining  at  onoe  FresneTs  laws  of  double  refraction  in  crystals  and 
those  governing  the  intensity  of  reflexion  when  light  passes  from  one  isotropic  medium 
to  another)."    Lord  Rayleigh,  PhU.  Mag.,  September,  1888. 

t  Eleotrical  motions  in  air,  sinoe  the  recent  experiments  of  Professor  Herts,  seem  to 
be  no  longer  a  matter  of  hypothesis.  We  can  hardly  suppose  that  the  case  is  essentially 
different  with  the  so-called  vacuum.  The  theorem  that  the  eleotrical  motions  of  light 
are  solenoids!,  although  it  is  convenient  to  assume  it  as  a  hypothesis  and  show  that  the 
results  agree  with  experiment,  need  not  occupy  any  such  fundamental  position  in  the 
theory.  It  is  in  fact  only  another  way  of  saying  that  two  of  the  constants  of  eleotrioal 
scienoe  have  a  certain  ratio  (infinity).  It  would  be  easy  to  commence  without  — nmfag 
this  value,  and  to  show  in  the  course  of  the  development  of  the  subject  that  experiment 
requires  it,  not  of  course  as  an  abstract  proposition,  but  in  the  sense  in  which 
experiment  oan  be  said  to  require  any  values  of  any  constants,  that  is,  to  a  certain 
degree  of  approximation. 
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XVI. 

REVIEWS  OF  NEWCOMB  AND  MICHELSON'S  -VELOCITY 
OF  LIGHT  IN  AIR  AND  REFRACTING  MEDIA"  AND 
OF  KETTELER'S  "  THEORETISOHE  OPTIK" 

[American  Journal  of  Science,  ser.  3,  vol.  xxxl  pp.  62-67,  Jan.  1886.] 

Velocity  of  Light  in  Air  and  Refracting  Media. 

Astrvnonomical  Papers  prepared  for  the  use  of  the  American  Bphemeris  and 
Nautical  Almanac,  vol.  n.  parts  3  and  4,*  Washington,  1885. 

Professor  Nbwcomb  obtains  as  the  final  result  of  his  experiments 
at  Washington  299,860  ±30  kilometers  per  second  for  the  velocity  of 
light  in  vacuo.  Professor  MICHELSON'S  entirely  independent  experi- 
ments at  Cleveland  give  substantially  the  same  result  (299,853  ±60) 
His  former  experiments  at  the  Naval  Academy,  after  correction  of 
two  small  errors  which  he  now  reports,  give  299,910  ±50.  All  these 
experiments  were  made  with  the  revolving  mirror,  but  the  arrange- 
ments of  the  two  experimenters  were  in  other  respects  radically 
different  The  first  of  these  values  of  the  velocity  of  light  with 
Nyr&i's  value  of  the  constant  of  aberration  (20**492)  gives  149*60  for 
the  distance  of  the  sun  in  millions  of  kilometers.  On  acount  of  the 
recent  announcement  by  Messrs.  Young  and  Forbes  of  a  difference 
of  about  two  per  cent,  in  the  velocities  of  red  and  blue  light, 
especial  attention  was  paid  to  this  point  by  both  experimenters, 
without  finding  the  least  indication  of  any  difference.  In  Professor 
NewcomVs  experiments,  a  difference  of  only  one  thousandth  in  these 
velocities  would  have  produced  a  well-marked  iridescence  on  the 
edges  of  the  return  image  of  the  slit  formed  by  reflection  from  the 
revolving  mirror.  No  trace  of  such  iridescence  could  ever  be  seen. 
Professor  Michelson  made  an  experiment,  in  which  a  red  glass  covered 
one-half  the  slit.  The  two  halves  of  the  image — the  upper  white, 
the  lower  red — were  exactly  in  line. 

Since  Maxwell's  electromagnetic  theory  of  light  makes  the  velocity 
of  light   in  air  equal   to  the  ratio   of   the    electromagnetic    and 

•[Part  8,  "Measure*  of  the  Velocity  of  Light/'  8.  Newoomb;  part  4,  "Supple- 
mentary Measures  of  the  Velocities  of  white  and  colored  light  in  air,  water, 
and  carbon  disulphide,"  A.  A.  Michelson.] 
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electrostatic  units  of  electricity,  it  will  be  interesting  to  compare 
some  recent  determinations  of  this  ratio.  These  we  give  in  the 
following  table.  Since  the  determinations  are  affected  by  any  error 
in  the  standard  of  resistance,  we  have  corrected  the  results,  first,  on 
the  supposition  that  the  B.A.  ohm  =  '987  true  ohms  (Lord  Rayleigh's 
result),  and  secondly,  on  the  supposition  that  the  B.A.  ohm  =989  true 
ohms,  which  is  essentially  assuming  that  the  legal  ohm  represents  the 
true  value. 

Ratio  of  Electromagnetic  and  BUectrostatic  unite  of  Electricity  m  millions  of 
meters  and  seconds. 

Date.  AtpubUalMd.      B.A.  ohm-*M7.     B.  A.  ohm-  -9». 

Ayrton  &  Perry,#  1878  288-0  2961  2964 

Hookin,t  1879  298*8  296*9  297*2 

ShidA,:  1880  299*5  295*6  296*2 

ExnerJ  1882  801  *1(!)  291  *7(?)  292*3  (?) 

J.  J.  Thomson,  0  1883  296*3  296*3  296*9 

Klemen»i8,f  1884  301*88(?)  301*88(?)  302*48  (?) 

These  numbers  are  to  be  compared  with  the  velocity  of  light  in  air,  in 
millions  of  meters  per  second,  for  which  Professor  Newcomb  gives 
299*778.  Of  the  electrical  determinations,  that  of  J.  J.  Thomson 
appears  by  far  the  most  worthy  of  confidence.  That  of  Elemenci6 — 
the  only  one  as  great  as  the  velocity  of  light — was  obtained  by  the 
use  of  a  condenser  with  glass, — a  method  which  would  presumably 
give  too  great  a  ratio.  Exner's  value  is  obtained  from  the  mean  of 
three  determinations,  one  of  which  differed  from  the  others  by  about 
three  per  cent  If  we  reject  this  discordant  determination,  the  mean 
of  the  other  two  would  give  when  corrected  for  resistance  294*4  and 
295*0.  If  we  set  aside  the  determinations  of  Exner  and  KlemenclS, 
the  remaining  four,  which  represent  three  different  methods,  are  very 
accordant,  the  mean  being  nearly  identical  with  the  result  of  J.  J. 
Thomson,  and  about  one  per  cent,  less  than  the  velocity  of  light 

Professor  Michelson's  experiments  on  the  velocity  of  light  in  carbon 
disulphide  afford  an  interesting  illustration  of  the  difference  between 
the  velocity  of  waves  and  the  velocity  of  groups  of  waves — a  subject 
which  is  treated  at  length  in  an  appendix  to  the  second  volume  of 
Lord  Rayleigh's  Theory  of  Sovmd.  If  we  write  V  for  the  velocity 
of  waves,  U  for  that  of  a  group  of  waves,  L  for  the  wave-length,  and 
T  for  the  period  of  vibration, 

V-L  u-^T-1) 

For  purposes  of  numerical  calculation,  it  will  be  convenient  to 
transform  these  formulae  by  the  use  of  X  for  the  wave-length  in 

*  Phil.  Mag.,  (5),  voL  vii,  p.  277.      t  Report  Brit.  Assoc,  1879,  p.  285. 

tPhil.  Mag.,  (5),  vol  x,  p.  431.         f  SitzunysberichU  Wien,  Ahad.,  vol  lxxxvi,  p.  106. 

\\Phil.  Trans.,  voL  olxxiv,  p.  707.     *¥ Sitxungeberichte  Wien.  AkacL,  voL  lxxxix,  p.  298. 
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vacuo,  n  for  the  index  of  refraction  of  the  medium  considered,  and  k 
for  the  velocity  of  light  in  vacuo,  which  we  shall  regard  as  constant, 
in  accordance  with  general  usage.  By  substitution  of  these  letters  we 
easily  obtain 

k_  kdjnX'1) 

The  data  for  the  calculation  of  these  quantities  for  carbon  disulphide 
are  given  by  Verdet  (Annates  de  Chvmie,  et  de  Phypiqtu,  (3),  vol  lxix, 
p.  470).    They  give 

for  the  line  D,  'i/V = 1624,  ifc/U  « 1-722, 
for  the  line  E,ifc/V= 1637,  k/U =1-767. 

The  quotient  of  the  velocity  in  vacuo  divided  by  the  velocity  in 
carbon  disulphide,  according  to  Professor  Michelson's  experiments 
with  the  light  of  an  arc  lamp,  is  1*76  ±"02,  which  agrees  very  well 
with-  k/JJ.  Another  theory,  which  would  make  the  velocity  observed 
in  such  experiments  V*/U  (Nature,  vol.  xxv,  p.  52),  receives  no 
countenance  from  these  experiments.  The  value  of  HJ/V*  would 
be  about  1*53.  Some  may  think  that  the  experiments  on  water  point 
in  a  different  direction.  Taking  our  data  from  Beer  8  Einleitung  in 
die  hohere  Optik,  1863,  p.  411,  we  get 

forD,  Jfc/V=l-334,  &/TJ=l-352,  JbU/V«l-816, 
forE,  i/V- 1-336,  fc/U->  1-859,  fcU/V*=  1-313. 

The  number  obtained  by  experiment  was  1,330,  which  agrees  better 
with  k/V,  or  even  with  kU/V*,  than  with  k/\J,  but  the  differences  are 
here  too  small  to  have  much  significance. 


Theoretische    Optik,    gegrundet    auf   das    Besael-Sellmeier'scne 
Princip,  xugleich  mit  den  experimentellen  Belegen. 

Yon  Dr.  E.  Ebttbler,  Professor  an  der  Universitat  in  Bonn.    Viewig 
und  Sohn.    Braunschweig,  1885. 

The  principle  of  Sellmeier,  here  referred  to,  relates  to  vibrations  of 
ponderable  particles  excited  by  the  etherial  vibrations  of  light,  and  to 
the  reaction  of  the  former  upon  the  latter.  The  name  of  Bessel  is 
added  on  account  of  his  previous  solution  of  a  somewhat  analogous 
problem  relating  to  the  pendulum.  The  object  of  this  work  is  "to 
treat  theoretical  optics  in  a  complete  and  uniform  manner  on  the  new 
foundation  of  the  simultaneous  vibration  of  etherial  and  ponderable 
particles,  and  to  substitute  a  consistent  and  systematic  new  structure 
for  the  present  conglomerate  of  more  or  less  disconnected  principles." 
Such  a  work  demands  a  critical  examination,  which  should  not  be 
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undertaken  from  any  narrow  point  of  view.  Any  faults  of  detail  will 
be  readily  forgiven,  if  the  author  shall  give  the  theory  of  optics 
the  irov  crrw  which  it  has  sought  so  long  in  vain.  We  may  add  that 
if  this  effort  shall  not  be  judged  successful  by  the  scientific  world, 
the  author  will  at  least  have  the  satisfaction  of  being  associated  in  his 
failure  with  many  of  the  most  distinguished  names  in  mathematical 
physics. 

We  have  sought  to  test  the  proposed  theory  with  respect  to  that 
law  of  optics  which  seems  most  conspicuous  in  its  definite  mathe- 
matical form,  and  in  the  rigor  of  the  experimental  verifications  to 
which  it  has  been  subjected,  as  well  as  in  the  magnificent  develop- 
ments to  which  it  has  given  rise :  the  law  of  double  refraction  due  to 
Huyghens  and  Fresnel,  and  geometrically  illustrated  by  the  wave- 
surface  of  the  latter.  We  cannot  find  that  the  law  of  Fresnel  is 
proved  at  all  in  this  treatise.  We  find  on  the  contrary,  that  a  law  is 
deduced  which  is  different  from  Fresnel's,  and  inconsistent  with  it. 
We  do  not  refer  to  anything  relating  to  the  direction  of  vibration  of 
the  rays  in  a  crystal,  which  is  a  point  not  touched  by  the  experi- 
mental verifications  to  which  we  have  alluded  We  shall  confine 
our  comparison  to  those  equations  from  which  the  direction  of 
vibration  has  been  eliminated,  and  which  therefore  represent  relations 
subject  to  experimental  control.  For  this  purpose  equation  (13)  on 
page  299  is  suitable.     It  reads 

u*  v*  v? 

<T„.       II.  —  Q 

ti|— n*    tij — n*    n\— n*      ' 

n*>  ihfi  n*  being  *ne  principal  indices  of  refraction.  This  the  author 
calls  the  equation  of  the  wave-surface  or  surface  of  ray-velocities. 
It  has  the  form  of  the  equation  of  Fresnel's  wave-surface,  expressed 
in  terms  of  the  direction-cosines  and  reciprocal  of  the  radius  vector, 
and  if  u,  v,  w  are  the  direction-cosines  of  the  ray,  and  n  the  velocity 
of  light  in  vacuo  divided  by  the  so-called  ray-velocity  in  the  crystal 
the  equation  will  express  Fresnel's  law.  But  it  is  impossible  to  give 
these  meanings  to  u,  v,  w  and  n.  They  are  introduced  into  the 
discussion  in  the  expression  for  the  vibrations  (p.  295),  viz., 


,=*coe2^-2^±|2±^>). 


The  form  of  this  equation  shows  that  u,  vt  w  are  proportional  to  the 
direction-cosines  of  the  wave-normal,  and  as  the  relation  u*+ v*+w*=  1 
is  afterwards  used,  they  must  be  the  direction-cosines  of  the  wave- 
normal.  They  cannot  possibly  denote  the  direction-cosines  of  the  ray, 
except  in  the  particular  case  in  which  the  ray  and  wave-normal 
coincide.  Again,  from  the  form  of  this  equation,  \/n  must  be  the 
wave-length  in  the  crystal,  and  if  \  here  as  elsewhere  in  the  book 
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(see  p.  25)  denotes  the  wave-length  vn  vacuo  of  light  of  the  period 
considered,  which  we  doubt  not  is  the  intention  of  the  author,  n  must 
be  the  wave-length  in  vacuo  divided  by  the  wave-length  in  the 
crystal,  Le.,  the  velocity  of  light  in  vacuo  divided  by  the  wave-velocity 
in  the  crystal  With  these  definitions  of  u,  v,  w,  and  n,  equation  (13) 
expresses  a  law  which  is  different  from  FresneTs.  Applied  to  the 
simple  case  of  a  uniaxial  crystal,  it  makes  the  relation  between  the 
wave-velocity  of  the  extraordinary  ray  and  the  angle  of  the  wave- 
normal  with  the  principal  axis  the  same  as  that  of  the  radius  vector 
and  the  angle  in  an  ellipse.  The  law  of  Huyghens  and  Fresnel  makes 
the  reciprocal  of  the  wave- velocity  stand  in  this  relation. 

The  law  which  our  author  has  deduced  has  come  up  again  and 
again  in  the  history  of  theoretical  optica  Professor  Stokes  (Report 
of  the  British  Assoc.,  1862,  part  i,  p.  269)  and  Lord  Rayleigh  (Phil. 
Mag.,  (4),  vol.  xli,  p.  525)  have  both  raised  the  question  whether 
Huyghens  and  Fresnel  might  not  have  been  wrong,  and  it  might  not 
be  the  wave- velocity  and  not  its  reciprocal  which  is  represented  by 
the  radius  vector  in  an  ellipse.  The  difference  is  not  very  great,  for 
if  we  lay  off  on  the  radii  vectores  of  an  ellipse  distances  inversely 
proportional  to  their  lengths,  the  resultant  figure  will  have  an  oval 
form  approaching  that  of  an  ellipse  when  the  eccentricity'  of  the 
original  ellipse  is  small  Rankine  appears  to  have  thought  that  the 
difference  might  be  neglected  (see  Phil.  Mag.,  (4),  voL  i,  pp.  444,  445) 
at  least  he  claims  that  his  theory  leads  to  Fresnel's  law,  while  really 
it  would  give  the  same  law  which  our  author  has  found.  (Concerning 
Rankine's  "  splendid  failure/'  and  the  whole  history  of  the  subject,  see 
Sir  Wm.  Thomson's  Lectures  on  Molecular  Dynamics  at  the  Johns 
Hopkins  University ,  chap,  xx.)  Professor  Stokes  undertook  experi- 
ments to  decide  the  question.  His  result,  corroborated  by  Glazebrook 
(Pro.  Roy.  8oc.t  vol  xx,  p.  443;  PhU.  Trans.,  vol  clxxi,  p.  421),  was 
that  Huyghens  and  Fresnel  were  right  and  that  the  other  law  was 
wrong. 

To  return  to  our  author;  we  have  no  doubt  from  the  context  that 
he  regards  u,  v,  w,  and  n  as  relating  to  the  ray  and  not  to  the  wave- 
normal  We  suppose  that  that  is  the  meaning  of  his  remark  that 
the  expression  for  the  vibrations  (quoted  above)  is  to  be  referred  to 
the  direction  of  the  ray.  It  seems  rather  hqrd  not  to  allow  a  writer 
the  privilege  of  defining  his  own  terms.  Tet  the  reader  will  admit 
that  when  the  vibrations  have  been  expressed  in  the  above  form  an 
inexorable  necessity  fixes  the  significance  of  the  direction  determined 
by  u,  v,  w,  and  leaves  nothing  in  that  respect  to  the  choice  of  the 
author. 

The  historical  sketches  of  the  development  of  ideas  in  the  theory 
of  optics,  enriched  by  very  numerous  references,  will  be  useful  to 
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the  student.  An  exception,  however,  must  be  made  with  respect  to 
the  statements  concerning  the  electromagnetic  theory  of  light  We 
are  told  (p.  450)  that  the  English  theory,  founded  by  Maxwell  and 
represented  by  Glazebrook  and  Fitzgerald,  makes  the  plane  of  polari- 
zation coincide  with  the  plane  of  vibration,  while  Lorentz,  on  the 
basis  of  Helmholtz's  equations  comes  to  the  conclusion  that  these 
planes  are  at  right  angles.  Since  all  these  writers  make  the  electrical 
displacement  perpendicular  to  the  plane  of  polarization,  we  can  only 
attribute  this  statement  to  some  confusion  between  the  electrical 
displacement  and  the  magnetic  force  or  "  displacement "  at  right  angles 
to  it.  We  are  also  told  that  Glazebrook's  "  surface-conditions  "  which 
determine  the  intensity  of  reflected  and  refracted  light  are  different 
from  those  of  Lorentz, — a  singular  error  in  view  of  the  fact  that 
Mr.  Glazebrook  (Proc.  Carrib.  Phil.  Soc.t  vol.  iv,  p.  166)  expressly 
states  that  his  results  are  the  same  as  those  of  Lorentz,  Fitzgerald, 
and  J.  J.  Thomson.  We  have  spent  much  fruitless  labor  in  trying  to 
discover  where  and  how  the  expressions  were  obtained  which  are 
attributed  to  Glazebrook,  but  in  which  the  notation  has  been  altered. 
They  ought  to  come  from  Glazebrook's  equations  (24)-(27)  (loc.  cit.), 
but  these  appear  identical  with  Lorentz's  equations  (58)-(61)  (Zeit- 
schriftf.  Math.  u.  Phys.,  vol  xxii,  p.  27).  They  might  be  obtained  by 
interchanging  the  expressions  for  vibrations  in  the  plane  of  incidence 
and  at  right  angles  to  it,  with  two  changes  of  sign. 

The  reader  must  be  especially  cautioned  concerning  the  statements 
and  implications  of  what  has  not  been  done  in  the  electromagnetic 
theory.  These  are  such  as  to  suggest  the  question  whether  the  author 
has  taken  the  trouble  to  read  the  titles  of  the  papers  which  have 
been  published.  We  refer  especially  to  what  is  said  on  pages  248, 
249  concerning  absorption,  dispersion,  and  the  magnetic  rotation  of 
the  plane  of  polarization. 

In  the  Experimental  Part,  with  which  the  treatise  closes,  we  have 
a  comparison  of  formula  with  the  results  of  experiments  by  the 
author  and  others.  The  author  has  been  particularly  successful  in 
the  formula  for  dispersion.  In  the  case  of  quartz  (p.  545),  the 
formula  (with  four  constants)  represents  the  results  of  experiment  in 
a  manner  entirely  satisfactory  through  the  entire  range  of  wave- 
length from  2*14  to  0*214.  Those  who  may  not  agree  with  the 
author's  theoretical  views  will  nevertheless  be  glad  to  see  the  results 
of  experiment  brought  together,  and,  so  far  as  may  be,  represented  by 
formulae. 
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XVII. 

ON  THE  VELOCITY  OF  LIGHT  AS  DETERMINED  BY 
FOUCAULT'S  REVOLVING  MIRROR 

[Natute,  vol.  xxxm.  p.  582,  April  22,  1886.] 

It  has  been  shown  by  Lord  Rayleigh  and  others  that  the  velocity 
(U)  with  which  a  group  of  waves  is  propagated  in  any  medium  may 
be  calculated  by  the  formula 

where  V  is  the  wave-velocity,  and  X  the  wave-length.  It  has  also 
been  observed  by  Lord  Rayleigh  that  the  fronts  of  the  waves  reflected 
by  the  revolving  mirror  in  Foucault's  experiment  are  inclined  one  to 
another,  and  in  consequence  must  rotate  with  an  angular  velocity 

dV 
d\a> 
where  a  is  the  angle  between  two  successive  wave-planes  of  similar 
phase.     When  dV/d\  is  positive  (the  usual  case),  the  direction  of 
rotation  is  such  that  the  following  wave-plane  rotates  towards  the 
position  of  the  preceding  (see  Nature,  voL  xxv.  p.  52). 

But  I  am  not  aware  that  attention  has  been  called  to  the  important 
fact,  that  while  the  individual  wave  rotates  the  wave-normal  of  the 
group  remains  unchanged,  or,  in  other  words,  that  if  we  fix  our 
attention  on  a  point  moving  with  the  group,  therefore  with  the 
velocity  U,  the  successive  wave-planes,  as  they  pass  through  that 
point,  have  all  the  same  orientation.  This  follows  immediately  from 
the  two  formulae  quoted  above.  For  the  interval  of  time  between 
the  arrival  of  two  successive  wave-planes  of  similar  phase  at  the 
moving  point  is  evidently  \/(  F—  U)t  which  reduces  by  the  first  for- 
mula to  dkjdV.  In  this  time  the  second  of  the  wave-planes,  having 
the  angular  velocity  adV/d\t  will  rotate  through  an  angle  a  towards 
the  position  of  the  first  wave-plane.  But  a  is  the  angle  between  the 
two  planes.  The  second  plane,  therefore,  in  passing  the  moving  point, 
will  have  exactly  the  same  orientation  which  the  first  had.  To  get  a 
picture  of  the  phenomenon,  we  may  imagine  that  we  are  able  to  see 
a  few  inches  of  the  top  of  a  moving  carriage-wheel  The  individual 
spokes  rotate,  while  the  group  maintains  a  vertical  direction. 
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This  consideration  greatly  simplifies  the  theory  of  Foucault's 
experiment,  and  makes  it  evident,  I  think,  that  the  results  of  all 
such  experiments  depend  upon  the  value  of  U,  and  not  upon  that  of  V. 

The  discussion  of  the  experiment  by  following  a  single  wave,  and 
taking  account  of  its  rotation,  is  a  complicated  process,  and  one  in 
which  it  is  very  easy  to  leave  out  of  account  some  of  the  elements  of 
the  problem.  The  principal  objection  to  it,  however,  is  its  unreality. 
If  the  dispersion  is  considerable,  no  wave  which  leaves  the  revolving 
mirror  will  return  to  it.  The  individual  disappears,  only  the  group 
has  permanence.  Prof.  Schuster  in  his  communication  of  March  11 
(p.  430),  has  nevertheless  obtained  by  this  method,  as  the  quantity 
determined  by  "the  experiments  hitherto  performed,"  F"/(2F—  U), 
which,  as  he  observes,  is  nearly  equal  to  U.  He  would,  I  think, 
have  obtained  U  precisely,  if  for  the  angle  between  two  successive 
wave-planes  of  similar  phase,  instead  of  2w\/Vt  he  had  used  the  more 
exact  value  2w\/U, 

By  the  kindness  of  Prof.  Michelson,  I  am  informed  with  respect  to 
his  recent  experiments  on  the  velocity  of  light  in  bisulphide  of  carbon 
that  he  would  be  inclined  to  place  the  maximum  brilliancy  of  the 
light  between  the  spectral  lines  D  and  E,  but  nearer  to  D.  If  we 
take  the  mean  between  D  and  E,  we  have 


*_!**       ^£^>=l-737, 


K  denoting  the  velocity  in  vacuo  (see  p.  249  of  this  volume).  The 
number  observed  was  1*76,  "  with  an  uncertainty  of  two  units  in  the 
second  place  of  decimals."  This  agrees  best  with  the  first  formula. 
The  same  would  be  true  if  we  used  values  nearer  to  the  line  D. 

J.  WlLLABD  GlBBS. 

New  Haven,  Connecticut,  April  1.    [1886.] 
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VELOCITY  OF  PROPAGATION  OF  ELECTROSTATIC  FORCE. 

[Nature,  vol  Lni.  p.  609,  ^pril  2,  1896.] 

As  we  may  have  to  wait  some  time  for  the  experimental  solution 
of  Lord  Kelvin's  very  instructive  and  suggestive  problem  concerning 
two  pairs  of  spheres  charged  with  electricity  (see  Nature  of  February 
6,  p.  316),  it  may  be  interesting  to  see  what  the  solution  would  be 
from  the  standpoint  of  existing  electrical  theories. 

In  applying  Maxwell's  theory  to  the  problem  it  will  be  convenient 
to  suppose  the  dimensions  of  both  pairs  of  spheres  very  small  in 
comparison  with  the  unit  of  length,  and  the  distance  between  the 
two  pairs  very  great  in  comparison  with  the  same  unit  These 
conditions,  which  greatly  simplify  the  equations  which  represent  the 
phenomena,  will  hardly  be  regarded  as  affecting  the  essential  nature 
of  the  question  proposed. 

Let  us  first  consider  what  would  happen  on  the  discharge  of  (A,  B), 
if  the  system  (c,  d)  were  absent. 

Let  m0  be  the  initial  value  of  the  moment  of  the  charge  of  the 
system  (A,  B),  (this  term  being  used  in  a  sense  analogous  to  that 
in  which  we  speak  of  the  moment  of  a  magnet),  and  m  the  value 
of  the  moment  at  any  instant.    If  we  set 

m=F(0,  (1) 

and    suppose   the   discharge  to  commence   when  £  =  0,  and  to  be 
completed  when  <«A,  we  shall  have 

F(O=m0    when    *<0,  (2) 

and  F(*)  =  0      when    t>h,  (3) 

Let  us  set  the  origin  of  coordinates  at  the  centre  of  the  system 
(A,  B),  and  the  axis  of  x  in  the  direction  of  the  centre  of  the 
positively  charged  sphere.  A  unit  vector  in  this  direction  we  shall 
call  i,  Mid  the  vector  from  the  origin  to  the  point  considered  p. 
At  any  point  outside  of  a  sphere  of  unit  radius  about  the  origin, 
the  electrical  displacement  (£)  is  given  by  the  vector  equation 

4*© = [3r-«F(*  -  or)+3cr-4F'(*  -or)+cV-»F"(*  -  cr)]xp 

-[r-*F(t-cr)+cr'W(t-cr)+A-'1F''(t-cr)]i,        (4) 

where  F  denotes  the  function  determined  by  equation  (1),  F/  and  F" 
its  derivatives,  and  c  the  ratio  of  electrostatic  and  electromagnetic 
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units  of  electricity,  or  the  reciprocal  of  the  velocity  of  light  For 
this  satisfies  the  general  equation 

-V*X>=c*d*&ldt*t  (5) 

as  well  as  the  so-called  "  equation  of  continuity/'  and  also  satisfies 
the  special  conditions  that  when  t<0 

47rD=m0(3r-*aj/>  -  r'H) 
outaide  of  the  unit  sphere,  and  that  at  any  time  at  the  surface  of 
this  sphere  4x®=m(3a?/>-i), 

if  we  consider  the  terms  containing  the  factor  c  as  negligible,  when 
not  compensated  by  large  values  of  r.  That  equation  (4)  satisfies 
the  general  conditions  is  easily  verified,  if  we  set 

u=r-1F(«-cr),  (6) 

and  observe  that 

-V^-APtt/d*2,  (7) 

and  that  the  three  components  of  3)  are  given  by  the  equations 

4*/=  -d*u/dy*-<Puld** ' 

4Trg=d*uldxd.y  (8) 

4nrh=cPu/docdz 

Equation  (4)  shows  that  the  changes  of  the  electrical  displacement 
are  represented  by  three  systems  of  spherical  waves,  of  forms 
determined  by  the  rapidity  of  the  discharge  of  the  system  (A,  B), 
which  expand  with  the  velocity  of  light  with  amplitudes  diminishing 
as  r_a,  r~2,  and  r*1,  respectively.  Outside  of  these  waves,  the  electrical 
displacement  is  unchanged,  inside  of  them  it  is  zero. 

If  we  write  (with  Maxwell)  —d%/dt  for  the  force  of  electrodynamic 
induction  at  any  point,  and  suppose  its  rectangular  components 
calculated  from  those  of  —dflb/dt*  by  the  formula  used  in  calculating 
the  potential  of  a  mass  from  its  density,  we  shall  have  by  Poisson's 
theorem 

V\d%ldt)  =  tTrd*<Z>l<ti\ 

or  by  (5),  V2(d»/(ft)=  -4**r2V23>, 

whence  d^LJdt  =  -  4tc-*D.  (9) 

From  this,  with  (4),  and  the  general  equation 
^/^+4tc-2!D+VV  =  0> 
we  see  that  during  the  discharge  of  the  system  (A,  B)  the  electrostatic 
force  —  VV  vanishes  throughout  ail  space,  while  its  place  is  taken  by 
a  precisely  equal  electrodynamic  force  —d%ldL 

This  electrodynamic  force  remains  unchanged  at  every  point  until 
the  passage  of  the  waves,  after  which  the  electrostatic  force,  the 
electrodynamic  force,  and  the  displacement,  have  the  permanent  value 
zero. 
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If  we  write  Cturl  for  the  differentiating  vector  operator  which 
Maxwell  calls  by  that  name,  equations  (8)  may  be  put  in  the  form 

4ttD  =  Curl  Curl  (iu), 
whence  dfydt  =  (4*)-1  Curl  Curl  (i  du/dt). 

From  dfydt  we  may  calculate  the  magnetic  induction  9  by  an 
operation  which  is  the  inverse  of  (Inr)'1  Curl.     We  have  therefore 

%=Cxnl(idu,ldt)t 
or  %=[r-*F'(t-cr)+cr-*r'(t-cr)](yk-zft 

The  magnetic  induction  is  therefore  zero  except  in  the  waves. 

Equations  (4)  and  (9)  give  the  value  of  dtydt  as  function  of  t  and 
r.  By  integration,  we  may  find  the  value  of  U,  Maxwell's  "vector 
potential"  This  will  be  of  the  form  of  the  second  member  of  (4) 
multiplied  by  —  c~*,  if  we  should  give  each  F  one  accent  less,  and  for 
an  unaccented  F  should  write  F19  to  denote  the  primitive  of  F  which 
vanishes  for  the  argument  x . 

That  which  seems  most  worthy  of  notice  is  that  although  simul- 
taneously with  the  discharge  of  the  system  (A,  B)  the  values  of  what 
we  call  the  electric  potential,  the  electrodynamic  force  of  induction, 
and  the  "  vector  potential/'  are  changed  throughout  all  space,  this  does 
not  appear  connected  with  any  physical  change  outside  of  the  waves, 
which  advance  with  the  velocity  of  light 

If  we  now  suppose  that  there  is  a  second  pair  of  charged  spheres 
(c,  d),  as  in  the  original  problem,  the  discharge  of  this  pair  will 
evidently  occur  when  the  relaxation  of  electrical  displacement  reaches 
it.  The  time  between  the  discharges  is,  therefore,  by  Maxwell's 
theory,  the  time  required  for  light  to  pass  from  one  pair  to  the  other. 

It  may  also  be  interesting  to  observe  that  in  the  axis  of  xt  on  both 
sides  of  the  origin,  xp=rH,  and  equation  (4)  reduces  to 
4xJ) = [2r-»F(*  -  cr)+ 2cr-*F(*  -  cr)]i. 
Here,  therefore,  the  oscillations  are  normal  to  the  wave-surfaces. 
This  might  seem  to  imply  that  plane  waves  of  normal  oscillations 
may  be  propagated,  since  we  are  accustomed  to  regard  a  part  of  an 
infinite  sphere  as  equivalent  to  a  part  of  an  infinite  plane.  Of  course, 
such  a  result  would  be  contrary  to  Maxwell's  theory.  The  paradox 
is  explained  if  we  consider  that  the  parts  of  the  wave-motion, 
expressed  by  F  and  F,  diminish  more  rapidly  than  those  expressed  by 
F,  so  that  it  is  unsafe  to  take  the  displacements  in  the  axis  of  2  as 
approximately  representing  those  at  a  moderate  distance  from  it  In 
fact,  if  we  consider  the  displacements  not  merely  in  the  axis  of  x,  but 
within  a  cylinder  about  that  axis,  and  follow  the  waves  to  an  infinite 
distance  from  the  origin,  we  find  no  approximation  to  what  is  usually 
meant  by  plane  waves  with  normal  oscillations. 

J.  WlLLARD  QlBBS. 

New  Haven,  Conn.,  March  12  [1896]. 
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FOURIER'S  SERIES. 
[Nature,  vol.  ux,  p.  200,  Dec.  29,  1898.] 

I  should  like  to  add  a  few  words  concerning  the  subject  of  Prof. 
Michelson's  letter  in  Nature  of  October  6.  In  the  only  reply  which 
I  have  seen  (Nature,  October  13),  the  point  of  view  of  Prof.  Michelson 
is  hardly  considered. 

Let  us  write  /„(#)  for  the  sum  of  the  first  n  terms  of  the  series 

sin  a;— isin2g+}sin3a;—  £sin4g+etc. 

I  suppose  that  there  is  no  question  concerning  the  form  of  the  curve 
defined  by  any  equation  of  the  form 

Let  us  call  such  a  curve  Cn.  As  n  increases  without  limit,  the 
curve  approaches  a  limiting  form,  which  may  be  thus  described.  Let 
a  point  move  from  the  origin  in  a  straight  line  at  an  angle  of  45° 
with  the  axis  of  X  to  the  point  (tt,  tt),  thence  vertically  in  a  straight 
line  to  the  point  (tt,  —  x),  thence  obliquely  in  a  straight  line  to  the 
point  (Sir,  tc\  etc.  The  broken  line  thus  described  (continued 
indefinitely  forwards  and  backwards)  is  the  limiting  form  of  the 
curve  as  the  number  of  terms  increases  indefinitely.  That  is,  if  any 
small  distance  d  be  first  specified,  a  number  nf  may  be  then  specified, 
such  that  for  every  value  of  n  greater  than  n\  the  distance  of  any 
point  in  Cn  from  the  broken  line,  and  of  any  point  in  the  broken  line 
from  Cm  will  be  less  than  the  specified  distance  d. 

But  this  limiting  line  is  not  the  same  as  that  expressed  by  the 
equation  y= limit      2fn(x). 

The  vertical  portions  of  the  broken  line  described  above  are 
wanting  in  the  locus  expressed  by  this  equation,  except  the  points  in 
which  they  intersect  the  axis  of  X.  The  process  indicated  in  the 
last  equation  is  virtually  to  consider  the  intersections  of  Cn  with  fixed 
vertical  transversals,  and  seek  the  limiting  positions  when  n  is 
increased  without  limit.  It  is  not  surprising  that  this  proeess  does 
not  give  the  vertical  portions  of  the  limiting  curve.  If  we  should 
consider  the  intersections  of  Cn  with  horizontal  transversals,  and 
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seek  the  limits  which  they  approach  when  n  is  increased  indefinitely 
we  should  obtain  the  vertical  portions  of  the  limiting  curve  as  well  as 
the  oblique  portions. 

It  should  be  observed  that  if  we  take  the  equation 

and  proceed  to  the  limit  for  n  =  oo ,  we  do  not  necessarily  get  y»0  for 
x  =  tt.  We  may  get  that  ratio  by  first  setting  £«*-,  and  then  passing 
to  the  limit  We  may  also  get  y  =  l,  «=*■,  by  first  setting  y=l,  and 
then  passing  to  the  limit  Now  the  limit  represented  by  the  equation 
of  the  broken  line  described  above  is  not  a  special  or  partial  limit 
relating  solely  to  some  special  method  of  passing  to  the  limit,  but  it 
is  the  complete  limit  embracing  all  sets  of  values  of  x  and  y  which 
can  be  obtained  by  any  process  of  passing  to  the  limit. 

J.  WlLLABD  OlBBS. 

New  Haven,  Conn.,  November  29  [1898]. 


[Nature,  vol.  ux,  p.  606,  April  27,  1899.] 

I  should  like  to  correct  a  careless  error  which  I  made  {Natwre, 
December  29, 1898)  in  describing  the  limiting  form  of  the  family  of 
curves  represented  by  the  equation 

y  =  2fsina5— ^  sin  2a. ..±- sin  roe  J  (1) 

as  a  zigzag  line  consisting  of  alternate  inclined  and  vertical  portions. 
The  inclined  portions  were  correctly  given,  but  the  vertical  portions, 
which  are  bisected  by  the  axis  of  X,  extend  beyond  the  points  where 
they  meet  the  inclined  portions,  their  total  lengths  being  expressed 
by  four  times  the  definite  integral 

Jo    " 

If  we  call  this  combination  of  inclined  and  vertical  lines  C,  and 
the  graph  of  equation  (1)  C»,  and  if  any  finite  distance  d  be  specified, 
and  we  take  for  n  any  number  greater  than  100/cP,  the  distance  of 
every  point  in  Cn  from  C  is  less  than  d,  and  the  distance  of  every 
point  in  C  from  Cn  is  also  less  than  d.  We  may  therefore  call  C  the 
limit  (or  limiting  form)  of  the  sequence  of  curves  of  which  C„  is  the 
general  designation. 

But  this  limiting  form  of  the  graphs  of  the  functions  expressed 
by  the  sum  (1)  is  different  from  the  graph  of  the  function  expressed  by 
the  limit  of  that  sum.  In  the  latter  the  vertical  portions  are  wanting, 
except  their  middle  points. 
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I  think  this  distinction  important,  for  (with  exception  of  what 
relates  to  my  unfortunate  blonder  described  above)  whatever  differ- 
ences of  opinion  have  been  expressed  on  this  subject  seem  due,  for  the 
most  part,  to  the  fact  that  some  writers  have  had  in  mind  the  limit 
of  the  graphs,  and  others  the  graph  of  the  limit  of  the  sum.  A  mis- 
understanding on  this  point  is  a  natural  consequence  of  the  usage 
which  allows  us  to  omit  the  word  limit  in  certain  connections,  as 
when  we  speak  of  the  sum  of  an  infinite  series.  In  terms  thus 
abbreviated,  either  of  the  things  which  I  have  sought  to  distinguish  *! 

may  be  called  the  graph  of  the  sum  of  the  infinite  series.  | 

J.  WlLLARD  GIBBS. 

New  Haven,  April  12  [1899]. 
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RUDOLF  JULIUS  EMANUEL  CLAUSIUS. 
[Proceedings  of  the  American  Academy,  new  series,  vol  xvi,  pp.  458-465, 1889.] 

Rudolf  Julius  Emanuel  Clausius  was  born  at  Ctfslin  in  Pomer- 
ania,  January  2, 1822.  His  studies,  after  1840,  were  pursued  at  Berlin, 
where  he  became  Privat-docent  in  the  University,  and  Instructor  in 
Physics  in  the  School  of  Artillery.  He  was  Professor  of  Physics 
at  Zurich  in  the  Polytechnicum  (1855-67)  and  in  the  University 
(1857-67),  at  Wurzburg  (1867-69),  and  finally  at  Bonn  (1869-88), 
where  he  died  on  the  24th  of  August,  1888. 

His  literary  activity  commenced  in  1847,  with  the  publication  of 
a  memoir  in  OreUe's  Jowmal,  "Ueber  die  Iichtzerstreuung  in  der 
Atmosphare,  und  ttber  die  Intensitat  des  durch  die  Atmosph&re 
reflectirten  Sonnenlichts,"*  This  was  immediately  followed  by  other 
writings  relating  to  the  same  subject,  two  of  which  were  subsequently 
translated  from  PoggendorfjFs  AnnaZeni  for  Taylor's  Scientific 
Memoirs.  A  treatise  entitled  "Die  Iichterscheinungen  der  Atmo- 
sphare" formed  part  of  Grunert's  "Beitr&ge  zur  meteorologischen 
Optik." 

An  entirely  different  subject,  the  elasticity  of  solids,  was  discussed 
in  his  paper  (1849),  "  Ueber  die  Veranderungen,  welche  in  den  bisher 
gebrauchlichen  Formeln  fur  das  Gleichgewicht  und  die  Bewegung  fester 
Kttrper  durch  neuere  Beobachtungen  nothwendig  geworden  sincL"  \ 

But  it  was  with  questions  of  quite  another  order  of  magnitude  that 
his  name  was  destined  to  be  associated.  The  fundamental  questions 
concerning  the  relation  of  heat  to  mechanical  effect,  which  had  been 
raised  by  Rumford,  Carnot,  and  others,  to  meet  with  little  response, 
were  now  everywhere  pressing  to  the  front. 

"For  more  than  twelve  years,"  said  Regnault  in  1858,  "I  have 
been  engaged  in  collecting  the  materials  for  the  solution  of  this  ques- 
tion:— Given  a  certain  quantity  of  heat,  what  is,  theoretically,  the 
amount  of  mechanical  effect  which  can  be  obtained  by  applying  the 
heat  to  evaporation,  or  the  expansion  of  elastic  fluids,  in  the  various 
circumstances  which  can  be  realised  in  practice ?"§    The  twenty-first 

*  Vol  xxxiv,  p.  122,  arid  voL  xxxri,  p.  185.        +  VoL  lxxvi,  pp.  161  and  188. 

X  Pogg.  Awl,  voL  lxxvi,  p.  48  (1848).  §  Oomptes  Rendua,  voL  xxxri,  p.  876. 
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volume  of  the  Memoirs  of  the  Academy  of  Paris,  describing  the  first 
part  of  the  magnificent  series  of  researches  which  the  liberality  of  the 
French  government  enabled  him  to  carry  out  for  the  solution  of  this 
question,  was  published  in  1847.  In  the  same  year  appeared  Helm- 
holtz's  celebrated  memoir,  "Ueber  die  Erhaitung  der  Kraft."  For 
some  years  Joule  had  been  making  those  experiments  which  were 
to  associate  his  name  with  one  of  the  fundamental  laws  of  thermo- 
dynamics and  one  of  the  principal  constants  of  nature.  In  1849  he 
made  that  determination  of  the  mechanical  equivalent  of  heat  by  the 
stirring  of  water  which  for  nearly  thirty  years  remained  the  unques- 
tioned standard.  In  1848  and  1849  Sir  William  Thomson  was  engaged 
in  developing  the  consequences  of  Carnot's  theory  of  the  motive 
power  of  heat,  while  Professor  James  Thomson  in  demonstrating  the 
effect  of  pressure  on  the  freezing  point  of  water  by  a  Carnot's  cycle, 
showed  the  flexibility  and  the  fruitfulness  of  a  mode  of  demonstration 
which  was  to  become  canonical  in  thermodynamics.  Meantime 
Rankine  was  attacking  the  problem  in  his  own  way,  with  one  of 
those  marvellous  creations  of  the  imagination  of  which  it  is  so 
difficult  to  estimate  the  precise  value. 

Such  was  the  state  of  the  question  when  Clausius  published  his 
first  memoir  on  thermodynamics:  "Ueber  die  fcwegende  Kraft  der 
W&rme,  und  die  Gesetze,  welche  sich  daraus  fttr  die  Warmelehre  selbst 
ableiten  lassen."  * 

This  memoir  marks  an  epoch  in  the  history  of  physics.  If  we  say, 
in  the  words  used  by  Maxwell  some  years  ago,  that  thermodynamics 
is  "a  science  with  secure  foundations,  clear  definitions,  and  distinct 
boundaries,"  t  and  ask  when  those  foundations  were  laid,  those  defini- 
tions fixed,  and  those  boundaries  traced,  there  can  be  but  one  answer. 
Certainly  not  before  the  publication  of  that  memoir.  The  materials 
indeed  existed  for  such  a  science,  as  Clausius  showed  by  constructing 
it  from  such  materials,  substantially,  as  had  for  years  been  the  com- 
mon property  of  physicists.  But  truth  and  error  were  in  a  confusing 
state  of  mixture.  Neither  in  France,  nor  in  Germany,  nor  in  Great 
Britain,  can  we  find  the  answer  to  the  question  quoted  from  Regnault 
The  case  was  worse  than  this,  for  wrong  answers  were  confidently 
urged  by  the  highest  authorities.  That  question  was  completely 
answered,  on  its  theoretical  side,  in  the  memoir  of  Clausius,  and  the 
science  of  thermodynamics  came  into  existence.  And  as  Maxwell  said 
in  1878,  so  it  might  have  been  said  at  any  time  since  the  publication 
of  that  memoir,  that  the  foundations  of  the  science  were  secure,  its 
definitions  clear,  and  its  boundaries  distinct. 


*  Read  in  the  Berlin  Academy,  February  18,  1850,  and  published  in  the  March  and 
April  numbers  of  Poggendorft  Annalen. 
t  Nature,  vol.  xvii,  p.  257. 
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The  constructive  power  thus  exhibited,  this  ability  to  bring  order 
out  of  confusion,  this  breadth  of  view  which  could  apprehend  one 
truth  without  losing  sight  of  another,  this  nice  discrimination  to 
separate  truth  from  error,— these  are  qualities  which  place  the 
possessor  in  the  first  rank  of  scientific  men. 

In  the  development  of  the  various  consequences  of  the  funda- 
mental propositions  of  thermodynamics,  as  applied  to  all  kinds  of 
physical  phenomena,  Clausius  was  rivalled,  perhaps  surpassed,  in 
activity  and  versatility  by  Sir  William  Thomson.  His  attention, 
indeed,  seems  to  have  been  less  directed  toward  the  development  of 
the  subject  in  extension,  than  toward  the  nature  of  the  molecular 
phenomena  of  which  the  laws  of  thermodynamics  are  the  sensible 
expression.  He  seems  to  have  very  early  felt  the  conviction,  that 
behind  the  second  law  of  thermodynamics,  which  relates  to  the  heat 
absorbed  or  given  out  by  a  body,  and  therefore  capable  of  direct 
measurement,  there  was  another  law  of  similar  form  but  relating  to 
the  quantities  of  heat  (ie.,  molecular  vis  viva)  absorbed  in  the 
performance  of  work,  external  or  internal. 

This  may  be  made  more  definite,  if  we  express  the  second  law  in 
a  mathematical  form,  as  may  be  done  by  saying  that  in  any  reversible 
cyclic  process  which  a  body  may  undergo 


lf-o. 


where  dQ  is  an  elementary  portion  of  the  heat  imparted  to  the  body, 
and  t  the  absolute  temperature  of  the  body,  or  the  portion  of  it  which 
receives  the  heat  Or,  without  limitation  to  cyclic  processes,  we 
may  say  that  for  any  reversible  infinitesimal  change, 

dQ=tdSt 

where  S  denotes  a  certain  function  of  the  state  of  the  body,  called  by 
Clausius  the  entropy.  The  element  of  heat  may  evidently  be  divided 
into  two  parts,  of  which  one  represents  the  increase  of  molecular 
vie  viva  in  the  body,  and  the  other  the  work  done  against  forces, 
either  external  or  internal.     If  we  call  these  parts  dH  and  dQw,  we 

have  dQ=dH+dQw. 

Now  the  proposition  of  which  Clausius  felt  so  strong  a  conviction 
was  that  for  reversible  cyclic  processes 


jt-». 


and  that  for  any  reversible  infinitesimal  change 

dQw=tdZ, 
where  Z  is  another  function  of  the  state  of  the  body,  which  he 
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called  the  ctisgregation,  and  regarded  as  determined  by  the  positions 
of  the  elementary  parts  of  the  body  without  reference  to  their  veloci- 
ties. In  this  respect  it  differed  from  the  entropy.  An  immediate 
consequence  of  these  relations  is  that  for  any  reversible  cyclic  process 

and  therefore  that  H,  the  molecular  vis  viva  of  the  body,  must  be 
a  function  of  the  temperature  alone.  This  important  result  was 
expressed  by  Clausiua  in  the  following  words :  "  Die  Menge  der  in 
einem  Korper  wirklich  vorhandenen  Warme  ist  nur  von  seiner 
Temperatur  und  nicht  von  der  Anordnung  seiner  Bestandtheile 
abhangig." 

To  return  to  the  equation 

dQw=tdZ. 

This  expresses  that  heat  tends  to  increase  the  disgregation,  and  that 
the  intensity  of  this  tendency  is  proportional  to  the  absolute  tempera- 
ture. In  the  words  of  Clausius:  "Die  mechanische  Arbeit,  welche 
die  Warme  bei  irgend  einer  Anordnungsanderung  ernes  Korpers  thun 
kann,  ist  proportional  der  absoluten  Temperatur,  bei  welcher  die 
Aenderung  geschieht." 

Such  in  brief  and  in  part  were  the  views  advanced  by  Clausius  in 
1862,  in  his  memoir,  "Ueber  die  Anwendung  des  Satzes  von  der 
Aequivalenz  der  Verwandlungen  auf  die  innere  Arbeit."*  Although 
they  were  advanced  rather  as  a  hypothesis  than  as  anything  for 
which  he  could  give  a  formal  proof,  he  seems  to  have  little  doubt  of 
their  correctness,  and  his  confidence  seems  to  have  increased  with  the 
course  of  time. 

The  substantial  correctness  of  these  views  cannot  now  be  called  in 
question.  The  researches  especially  of  Maxwell  and  Boltzmann  have 
shown  that  the  molecular  vis  viva  is  proportional  to  the  absolute 
temperature,  and  Boltzmann  has  even  been  able  to  determine  the 
precise  nature  of  the  functions  which  Clausius  called  entropy  and 
disgregatioat  But  the  anticipation,  to  a  certain  extent,  at  so  early  a 
period  in  the  history  of  the  subject,  of  the  ultimate  form  which  the 
theory  was  to  take,  shows  a  remarkable  insight,  which  is  by  no 
means  to  be  lightly  esteemed  on  account  of  the  acknowledged  want  of 
a  rigorous  demonstration.  The  propositions,  indeed,  as  relating  to 
quantities  which  escape  direct  measurement,  belong  to  molecular 
science,  and  seem  to  require  for  their  complete  and  satisfactory 
demonstration  a  considerable  development   of  that   science.     This 


*  Pogg.  Ann.,  vol.  oxvi,  p.  73.     See  also  vol.  oxxvii,  p.  477  (1866). 
t Sitzungsberichlt  Wien.  Akad.,  vol.  lxiii,  p.  728  (1871). 
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development  naturally  commenced  with  the  simplest  case  involving 
the  characteristic  problems  of  the  subject, — the  case,  namely,  of  gases. 

The  origin  of  the  kinetic  theory  of  gases  is  lost  in  remote  antiquity, 
and  its  completion  the  most  sanguine  cannot  hope  to  see.  But  a 
single  generation  has  seen  it  advance  from  the  stage  of  vague  sur- 
mises to  an  extensive  and  well  established  body  of  doctrine.  This  is 
mainly  the  work  of  three  men,  Clausius,  Maxwell,  and  Boltzmann,  of 
whom  Clausius  was  the  earliest  in  the  field,  and  has  been  called  by 
Maxwell  the  principal  founder  of  the  science.*  We  may  regard  his 
paper  (1857),  "Ueber  die  Art  der  Bewegung,  welche  wir  Warme 
nennen,"t  as  marking  his  definite  entrance  into  this  field,  although 
many  points  were  incidentally  discussed  in  earlier  papers. 

This  was  soon  followed  by  his  papers, "  Ueber  die  mittlere  Lange 
der  Wege,  welche  bei  der  Molecularbewegung  gasf&rmiger  Ktfrper 
von  den  einzelnen  Moleclilen  zurtickgelegt  werden,"J  and  "Ueber  die 
Warmeleitung  gasfttrmiger  Korper."§ 

A  very  valuable  contribution  to  molecular  science  is  the  conception 
of  the  virial,  defined  in  his  paper  (1870),  "Ueber  einen  auf  die  Warme 
anwendbaren  Satz,"||  where  he  shows  that  in  any  case  of  stationary 
motion  the  mean  vis  viva  of  the  system  is  equal  to  its  virial. 

In  the  mean  time,  Maxwell  and  Boltzmann  had  entered  the  field. 
Maxwell's  first  paper,  "On  the  Motions  and  Collisions  of  perfectly 
elastic  Spheres,"  IT  was  characterized  by  a  new  manner  of  proposing 
the  problems  of  molecular  science.  Clausius  was  concerned  with  the 
mean  values  of  various  quantities  which  vary  enormously  in  the 
smallest  time  or  space  which  we  can  appreciate.  Maxwell  occupied 
himself  with  the  relative  frequency  of  the  various  values  which  these 
quantities  have.  In  this  he  was  followed  by  Boltzmann.  In  reading 
Clausius,  we  seem  to  be  reading  mechanics ;  in  reading  Maxwell,  and 
in  much  of  Boltzmann's  most  valuable  work,  we  seem  rather  to  be 
reading  in  the  theory  of  probabilities.  There  is  no  doubt  that  the 
larger  manner  in  which  Maxwell  and  Boltzmann  proposed  the 
problems  of  molecular  science  enabled  them  in  some  cases  to  get 
a  more  satisfactory  and  complete  answer,  even  for  those  questions 
which  do  not  at  first  sight  seem  to  require  so  broad  a  treatment. 

Boltzmann's  first  work,  however  (1866),  "Ueber  die  mechanische 
Bedeutung  des  zweiten  Hauptsatzes  der  Warmetheorie"**  was  in  a 
line  in  which  no  one  had  preceded  him,  although  he  was  followed  by 

*  Nature,  vol.  xvii,  p.  278. 

iPogg.  Ami.,  vol.  c,  p.  368  (1867). 

trbid.,  vol  ov,  p.  239  (1858).     See  also  Wied.  Arm.,  vol.  x,  p.  92. 

§  JWrf.,  voL  oxv,  p.  1  (1802). 

||  Ibid.,  vol.  cxli,  p.  124.     See  also  Jvbelband,  p.  411. 

VPhU.  Mag.,  vol.  xix,  p.  19  (1860). 

**  Sitzwigsberichie  Wien.  Akad.  vol.  liii,  p.  195. 


Digitized  by  VjOOQIC 


266  RUDOLF  JULIUS  EMANUEL  CLAUSIUa 

some  of  the  most  distinguished  names  among  his  contemporaries. 
Somewhat  later  (1870)  Clausius,  whose  attention  had  not  been  called 
to  Boltzmann's  work,  wrote  his  paper,  "Ueber  die  Zuruekfuhrung  des 
zweiten  Hauptsatzes  der  mechanischen  W&rmetheorie  auf  allgemeine 
mechanische  Priricipien."* 

The  point  of  departure  of  these  investigations,  and  others  to  which 
they  gave  rise,  is  the  consideration  of  the  mean  values  of  the  force- 
function  and  of  the  via  viva  of  a  system  in  which  the  motions  are 
periodic,  and  of  the  variations  of  these  mean  values  when  the  external 
influences  are  changed.  The  theorems  developed  belong  to  the  same 
general  category  as  the  principle  of  least  action,  and  the  principle 
or  principles  known  as  Hamilton's,  which  have  to  do,  explicitly  or 
implicitly,  with  the  variations  of  these  mean  values. 

Among  other  papers  of  Clausius  on  this  subject,  we  may  mention 
the  two  following:  " Ueber  einen  neuen  mechanischen  Satz  in 
Bezug  auf  station&re  Bewegung"t  (1873),  and  "Ueber  den  Satz  vom 
mittleren  Ergal  und  seine  Anwendung  auf  die  Molecularbewegungen 
der  GaseMt  (1874). 

The  first  problem  of  molecular  science  is  to  derive  from  the  observed 
properties  of  bodies  as  accurate  a  notion  as  possible  of  their  molecular 
constitution.  The  knowledge  we  may  gain  of  their  molecular  con- 
stitution may  then  be  utilized  in  the  search  for  formulas  to  represent 
their  observable  properties.  A  most  notable  achievement  in  this 
direction  is  that  of  van  der  Waals,  in  his  celebrated  memoir,  "  On  the 
Continuity  of  the  Gaseous  and  Liquid  States."  To  this  part  of  the 
subject  belong  the  following  papers  of  Clausius:  "Ueber  das  Verhalten 
der  Kohlens&ure  in  Bezug  auf  Druck,  Volumen  und  Temperatur,"§ 
and  "  Ueber  die  theoretische  Bestimmung  des  Dampf  druckes  und  der 
Volumina  des  Dampf es  und  der  Flttssigkeit"  (two  papers).  || 

Another  matter  in  which  Clausius  showed  his  originality  and 
power  was  the  vexed  subject  of  electrodynamics,  as  treated  in  his 
memoir,  "Ueber  die  Ableitung  eines  neuen  electrodynamischen 
Grundgesetzes."  1T  Various  points  in  the  theory  of  electricity  in 
which  the  principles  of  thermodynamics  or  of  molecular  science 
were  involved,  had  previously  been  treated  in  different  papers,  of 
which  the  earliest  appeared  in  1852,**  while  the  doctrine  of  the 


*  Pogg.  Ami.%  vol.  oxlii,  pi  433. 

ilbid.y  vol.  el,  p.  106. 

%IbuLt  Krg&nzttngsband  vii,  p.  215. 

§  Wied.  Ann.,  vol.  U,  p.  337  (1880). 

||  Ibid.,  vol.  xiv,  p.  279  and  p.  692  (1881). 

U  CrdU's  Journal,  vol.  lxxxii,  p.  86  (1877). 

**  "  Ueber  das  mechanische  Aequivalent  einer  electrischen  Entladung  und  die  dabei 
stattfindende  Erwarmung  des  Leitnngsdrahtes."  Pogg.  Ann,,  vol.  lxxxvi,  p.  337. 
44  Ueber  die  bei  einem  stationaren  electrischen  Strome  in  dem  Leiter  gethane  Arbeit 
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potential  (electrical  and  gravitational)  was  treated  in  a  separate  book, 
which  appeared  in  1859,  with  the  title,  "Die  Potentialfunction  and 
das  Potential,  ein  Beitrag  zur  mathematischen  Physik."  This 
subsequently  went  through  several  editions,  in  which  it  was  revised 
and  enlarged.  All  these  subjects,  with  others,  were  brought  together 
in  a  single  volume,  "  Die  mechanische  Behandlung  der  Mectricitat/' 
which  appeared  in  1879,  forming  the  second  volume  of  his 
"Mechanische  Warmetheorie." *  Later  papers  on  electricity  related 
to  the  principles  of  electrodynamics,  t  electrical  and  magnetic  units,  J. 
and  dynamo-electric  machines^ 

The  Royal  Society's  catalogue  of  scientific  papers,  and  the  excellent 
indices  to  the  Annalen  der  Physik  und  Chemiey  in  which  Clausius's 
work  usually  appeared,,  render  it  unnecessary  to  enumerate  in  detail 
his  scientific  papers.  The  list,  indeed,  would  be  a  long  one.  The 
Royal  Society's  catalogue  gives  seventy-seven  titles  for  the  years 
1847-1873.  Subsequently  twenty-five  papers  have  appeared  in  the 
Annalen  alone,  and  about  half  as  many  others  elsewhere. 

But  such  work  as  that  of  Clausius  is  not  measured  by  counting 
titles  or  pages.  His  true  monument  lies  not  on  the  shelves  of 
libraries,  but  in  the  thoughts  of  men,  and  in  the  history  of  more 
than  one  science. 


and  erzeugte  Warme."  Pogg.  Aim,,  voL  lxxxvii,  p.  415  (1862).  "Ueber  die  An- 
wendung  der  meohanisohen  Warmetheorie  auf  die  thennoeleotrisohen  Krsoheinungen." 
Pogg.  An*.,  vol.  xo,  p.  513  (1858).  "  Ueber  die  Eleotrimtttsleitung  in  Eleotrolyten." 
Pogg.  Atm.,  vol  oi,  p.  838  (1857). 

*  The  first  volume  of  this  work  appeared  in  1876,  and  oontained  the  general  theory 
with  the  more  immediate  oonseQuenoes  of  the  two  fundamental  laws.  The  third  volume 
has  not  yet  appeared,  but  it  is  expected  very  soon,  edited  by  Professor  Planck  and 
Dr.  Pulf  rich.  In  a  certain  sense  this  work  may  be  regarded  as  a  second  edition  of  an 
earner  one  (1864  and  1867),  which  consisted  of  a  reprint  of  papers  and  had  the  title 
44  Abhandlungen  fiber  die  meohanisohe  Warmetheorie." 

t  Wied.  Arm.,  voL  x,  p.  606 ;  voL  xi,  p.  604. 

%Ibid.t  vol.  xvi,  p.  529 ;  voL  xvii,  p.  713. 

| Ibid.,  vol.  xx,  p.  353 ;  vol  xxi,  p.  385. 
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HUBERT  ANSON  NEWTON. 

[American  Journal  of  Science,  ser.  4,  vol.  in,  pp.  359-376,  May  1897.] 

(Bead  before  the  National  Academy  of  Sciences,  in  April,  1897.) 

Hubert  Anson  Newton  was  bom  on  March  19th,  1830,  at 
Sherburne,  N.Y.,  and  died  at  New  Haven,  Conn.,  on  the  12th  day  of 
August,  1896.  He  was  the  fifth  son  of  a  family  of  seven  sons  and 
four  daughters,  children  of  William  and  Lois  (Butler)  Newton.  The 
parents  traced  their  ancestory  back  to  the  first  settlers  of  Massa- 
chusetts and  Connecticut*  and  had  migrated  from  the  latter  to 
Sherburne,  when  many  parts  of  central  New  York  were  still  a 
wilderness.  They  both  belonged  to  families  remarkable  for  longevity, 
and  lived  themselves  to  the  ages  of  ninety-three  and  ninety-four 
years.  Of  the  children,  all  the  sons  and  two  daughters  were  living  as 
recently  as  the  year  1889,  the  youngest  being  then  fifty-three  years 
of  age.  William  Newton  was  a  man  of  considerable  enterprise,  and 
undertook  the  construction  of  the  Buffalo  section  of  the  Erie  canal, 
as  well  as  other  work  in  canal  and  railroad  construction  in  New 
York  and  Pennsylvania.  In  these  constructions  he  is  said  to  have 
relied  on  his  native  abilities  to  think  out  for  himself  the  solution 
of  problems  which  are  generally  a  matter  of  technical  training.  His 
wife  was  remarkable  for  great  strength  of  character  united  with  a 
quiet  temperament  and  well-balanced  mind,  and  was  noted  among 
her  neighbors  for  her  mathematical  powers. 

Young  Newton,  whose  mental  endowments  were  thus  evidently 
inherited,  and  whose  controlling  tastes  were  manifested  at  a  very 
early  age,  fitted  for  college  at  the  schools  of  Sherburne,  and  at  the 
age  of  sixteen  entered  Yale  College  in  the  class  graduating  in  1850. 
After  graduation  he  pursued  his  mathematical  studies  at  New  Haven 
and  at  home,  and  became  tutor  at  Yale  in  January,  1853,  when  on 
account  of  the  sickness  and  death  of  Professor  Stanley  the  whole 
charge  of  the  mathematical  department  devolved  on  him  from  the  first. 

*  Richard  Butler,  the  great-grandfather  of  Lois  Butler,  came  over  from  England 
before  1633,  and  was  one  of  those  who  removed  from  Cambridge  to  Hartford.  An 
ancestor  of  William  Newton  came  directly  from  England  to  the  New  Haven  colony 
about  the  middle  of  the  same  century. 
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In  1855,  he  was  appointed  professor  of  mathematics  at  the  early 
age  of  twenty-five.  This  appointment  testifies  to  the  confidence 
which  was  felt  in  his  abilities,  and  is  almost  the  only  instance  in 
which  the  Yale  Corporation  has  conferred  the  dignity  of  a  full 
professorship  on  so  young  a  man. 

This  appointment  being  accompanied  with  a  leave  of  absence  for  a 
year,  in  order  to  give  him  the  opportunity  to  study  in  Europe,  it  was 
but  natural  that  he  should  be  attracted  to  Paris,  where  Chasles  was 
expounding  at  the  Sorbonne  that  modern  higher  geometry  of  which 
he  was  to  so  large  an  extent  the  creator,  and  which  appeals  so 
strongly  to  the  sense  of  the  beautiful.  And  it  was  inevitable  that 
the  student  should  be  profoundly  impressed  by  the  genius  of  his 
teacher  and  by  the  f ruitfulneas  and  elegance  of  the  methods  which  he 
was  introducing.  The  effect  of  this  year's  study  under  the  inspiring 
influence  of  such  a  master  is  seen  in  several  contributions  to  the 
Mathematical  Monthly  during  its  brief  existence  in  the  years  1858-61. 
One  of  these  was  a  problem  which  attracted  at  once  the  attention 
of  Cayley,  who  sent  a  solution.  Another  was  a  discussion  of  the 
problem  "  to  draw  a  circle  tangent  to  three  given  circles,"  remarkable 
for  his  use  of  the  principle  of  inversion.  A  third  was  a  very 
elaborate  memoir  on  the  construction  of  curves  by  the  straight  edge 
and  compasses,  and  by  the  straight  edge  alone.  These  early  essays  in 
geometry  show  a  mind  thoroughly  imbued  with  the  spirit  of  modern 
geometry,  skilful  in  the  use  of  its  methods,  and  eager  to  extend  the 
bounds  of  our  knowledge. 

Nevertheless,  although  for  many  years  the  higher  geometry  was 
with  him  a  favorite  subject  of  instruction  for  his  more  advanced 
students,  either  his  own  preferences,  or  perhaps  rather  the  influence 
of  his  environment,  was  destined  to  lead  him  into  a  very  different 
field  of  research.  In  the  attention  which  has  been  paid  to  astronomy 
in  this  country  we  may  recognize  the  history  of  the  world  repeating 
itself  in  a  new  country  in  respect  to  the  order  of  the  development  of 
the  sciences,  or  it  may  be  enough  to  say  that  the  questions  which 
nature  forces  on  us  are  likely  to  get  more  attention  in  a  new  country 
and  a  bustling  age,  than  those  which  a  reflective  mind  puts  to  itself, 
and  that  the  love  of  abstract  truth  which  prompts  to  the  construction 
of  a  system  of  doctrine,  and  the  refined  taste  which  is  a  critic  of 
methods  of  demonstration,  are  matters  of  slow  growth.  At  all  events, 
when  Professor  Newton  was  entering  upon  his  professorship,  the 
study  of  the  higher  geometry  was  less  consonant  with  the  spirit  of 
the  age  in  this  country  than  the  pursuit  of  astronomical  knowledge, 
and  the  latter  sphere  of  activity  soon  engrossed  his  best  efforts. 

Yet  it  was  not  in  any  of  the  beaten  paths  of  astronomers  that 
Professor  Newton  was  to  move.    It  was  rather  in  the  wilds  of  a 
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terra  incognita,  which  astronomers  had  hardly  troubled  themselves  to 
claim  as  belonging  to  their  domain,  that  he  first  labored  to  establish 
law  and  order.  It  was  doubtless  not  by  chance  that  he  turned  his 
attention  to  the  subject  of  shooting  stars.  The  interest  awakened  in 
this  country  by  the  stupendous  spectacle  of  1833,  which  was  not  seen 
in  Europe,  had  not  died  out.  This  was  especially  true  at  New  Haven, 
where  Mr.  Edward  C.  Herrick  was  distinguished  for  his  indefatigable 
industry  both  in  personal  observation  and  in  the  search  for  records  of 
former  showers.  A  rich  accumulation  of  material  was  thus  awaiting 
development.  In  1861,  the  Connecticut  Academy  of  Arts  and  Sciences 
appointed  a  committee  "to  communicate  with  observers  in  various 
localities  for  combined  and  systematic  observations  upon  the  August 
and  November  meteors."  In  this  committee  Professor  Newton  was 
preeminently  active.  He  entered  zealously  upon  the  work  of  col- 
lecting material  by  personal  observation  and  correspondence  and  by 
organizing  corps  of  observers  of  students  and  others,  and  at  the  same 
time  set  himself  to  utilize  the  material  thus  obtained  by  the  most 
careful  study.  The  value  of  the  observations  collected  was  greatly 
increased  by  a  map  of  the  heavens  for  plotting  meteor-paths,  which 
was  prepared  by  Professor  Newton  and  printed  at  the  expense  of  the 
Connecticut  Academy  for  distribution  among  observers. 

By  these  organized  efforts,  in  a  great  number  of  cases  observations 
were  obtained  on  the  same  meteor  as  seen  from  different  places,  and 
the  actual  path  in  the  atmosphere  was  computed  by  Professor  Newton. 
In  a  paper  published  in  1865  *  the  vertical  height  of  the  beginning 
and  the  end  of  the  visible  part  of  the  path  is  given  for  more  than 
one  hundred  meteors  observed  on  the  nights  of  August  10th  and 
November  13th,  1863.  It  was  shown  that  the  average  height  of  the 
November  meteors  is  fifteen  or  twenty  miles  higher  than  that  of 
the  August  meteors,  the  former  beginning  in  the  mean  at  a  height 
of  ninety-six  miles  and  ending  at  sixty-one,  the  latter  beginning  at 
seventy  and  ending  at  fifty-six. 

We  mention  this  paper  first,  because  it  seems  to  represent  the 
culmination  of  a  line  of  activity  into  which  Professor  Newton  had 
entered  much  earlier.  We  must  go  back  to  consider  other  papers 
which  he  had  published  in  the  meantime. 

His  first  papers  on  this  subject,  1860-62,t  were  principally  devoted 
to  the  determination  of  the  paths  and  velocities  of  certain  brilliant 
meteors  or  fireballs,  which  had  attracted  the  attention  of  observers  in 
different  localities.  Three  of  these  appeared  to  have  velocities  much 
greater  than  is  possible  for  permanent  members  of  the  solar  system. 
To  another  a  particular  interest  attached  as  belonging  to  the  August 

*  Amer.  Jour.  ScL,  «er.  %  vol  xl,  p.  250. 

t  Amer.  Jour.  8ci.t  eer.  2,  voL  xxx,  p.  186 ;  xxxii,  p.  448 ;  and  xxxiii,  p.  338. 
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shower,  although  exceptional  in  size.  For  this  he  calculated  the 
elements  of  the  orbit  which  would  give  the  observed  path  and  velocity. 
But  the  determination  of  the  velocity  in  such  cases,  which  depends 
upon  the  estimation  by  the  observers  of  the  time  of  flight,  is  neces- 
sarily very  uncertain,  and  at  best  affords  only  a  lower  limit  for  the 
value  of  the  original  velocity  of  the  body  before  it  encountered  the 
resistance  of  the  earth's  atmosphere.  This  would  seem  to  constitute  an 
insuperable  difficulty  in  the  determination  of  the  orbits  of  meteoroids, 
to  use  the  term  which  Professor  Newton  applied  to  these  bodies., 
before  they  enter  the  earth's  atmosphere  to  appear  for  a  moment  as 
luminous  meteors.  Yet  it  has  been  completely  overcome  in  the  case 
of  the  November  meteors,  or  Leonids  as  they  are  called  from  the 
constellation  from  which  they  appear  to  radiate.  This  achievement 
constitutes  one  of  the  most  interesting  chapters  in  the  history  of 
meteoric  science,  and  gives  the  subject  an  honorable  place  among  the 
exact  sciences. 

In  the  first  place,  by  a  careful  study  of  the  records,  Professor 
Newton  showed  that  the  connection  of  early  showers  with  those  of 
1799  and  1833  had  been  masked  by  a  progressive  change  in  the  time 
of  the  year  in  which  the  shower  occurs.  This  change  had  amounted 
to  a  full  month  between  aj>.  902,  when  the  shower  occurred  on 
October  13,  and  1833,  when  it  occurred  on  November  13.  It  is  in 
part  due  to  the  precession  of  the  equinoxes,  and  in  part  to  the  motion 
of  the  node  where  the  earth's  orbit  meets  that  of  the  meteoroids. 
This  motion  must  be  attributed  to  the  perturbations  of  the  orbits  of 
the  meteoroids  which  are  produced  by  the  attractions  of  the  planets, 
and  being  in  the  direction  opposite  to  that  of  the  equinoxes,  Professor 
Newton  inferred  that  the  motion  of  the  meteoroids  must  be  retrograde. 

The  showers  do  not,  however,  occur  whenever  the  earth  passes  the 
node,  but  only  when  the  passage  occurs  within  a  year  or  two  before 
or  after  the  termination  of  a  cycle  of  82*25  years.  This  number  is 
obtained  by  dividing  the  interval  between  the  showers  of  902  and 
1833  by  28,  the  number  of  cycles  between  these  dates,  and  must 
therefore  be  a  very  close  approximation.  For  if  these  showers  did 
not  mark  the  precise  end  of  cycles  the  resultant  error  would  be 
divided  by  28.  Professor  Newton  showed  that  this  value  of  the  cycle 
requires  that  the  number  of  revolutions  performed  by  the  meteoroids 
in  one  year  should  be  either  2+g^  or  l±g^  or  g^.  In  other  words, 
the  periodic  time  of  the  meteoroids  must  be  either  180*0  or  185*4  or 
3546  or  376*6  days,  or  33*25  years.  Now  the  velocity  of  any  body 
in  the  solar  system  has  a  simple  relation  to  its  periodic  time  and  its 
distance  from  the  sun.  Assuming,  therefore,  any  one  of  these  five 
values  of  the  periodic  time,  we  have  the  velocities  of  the  Leonids 
at  the  node  very  sharply  determined.    From  this  velocity,  with  the 
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position  of  the  apparent  radiant,  which  gives  the  direction  of  the 
relative  motion,  and  with  the  knowledge  that  the  heliocentric  motion 
is  retrograde,  we  may  easily  determine  the  orbit 

We  have,  therefore,  five  orbits  from  which  to  choose.  The 
calculation  of  the  secular  motion  of  the  node  due  to  the  disturbing 
action  of  the  planets,  would  enable  us  to  decide  between  these  orbits. 

Such  are  the  most  important  conclusions  which  Professor  Newton 
derived  from  the  study  of  these  remarkable  showers,  interesting  not 
only  from  the  magnificence  of  the  spectacle  occasionally  exhibited, 
but  in  a  much  higher  degree  from  the  peculiarity  in  the  periodic 
character  of  their  occurrence,  which  affords  the  means  of  the  deter- 
mination of  the  orbit  of  the  meteoroids  with  a  precision  which  would 
at  first  sight  appear  impossible. 

Professor  Newton  anticipated  a  notable  return  of  the  shower  in 
1866,  with  some  precursors  in  the  years  immediately  preceding,  a 
prediction  which  was  amply  verified.  In  the  meantime  he  turned 
his  attention  to  the  properties  which  belong  to  shooting  stars  in 
general,  and  especially  to  those  average  values  which  relate  to  large 
numbers  of  these  bodies  not  belonging  to  any  particular  swarm. 

This  kind  of  investigation  Maxwell  has  called  statistical,  and  has 
in  more  than  one  passage  signalized  its  difficulties.  The  writer 
recollects  a  passage  of  Maxwell  which  was  pointed  out  to  him  by 
Professor  Newton,  in  which  the  author  says  that  serious  errors  have 
been  made  in  such  inquiries  by  men  whose  competency  in  other 
branches  of  mathematics  was  unquestioned.  Doubtless  Professor 
Newton  was  very  conscious  of  the  necessity  of  caution  in  these 
inquiries,  as  is  indeed  abundantly  evident  from  the  manner  in  which 
he  expressed  his  conclusions;  but  the  writer  is  not  aware  of  any 
passage  in  which  he  has  afforded  an  illustration  of  Maxwell's  remark. 

The  results  of  these  investigations  appeared  in  an  elaborate  memoir 
"On  Shooting  Stars,"  which  was  read  to  the  National  Academy  in 
1864,  and  appeared  two  years  later  in  the  Memoirs  of  the  Academy.9 
An  abstract  was  given  in  the  American  Journal  of  Science  in  1865.t 
The  following  are  some  of  the  subjects  treated,  with  some  of  the  more 
interesting  results : 

The  distribution  of  the  apparent  paths  of  shooting  stars  in  azimuth 
and  altitude. 

The  vertical  distribution  of  the  luminous  part  of  the  real  paths. 
The  value  found  for  the  mean  height  of  the  middle  point  of  the 
luminous  path  was  a  trifle  less  than  sixty  miles. 

The  mean  length  of  apparent  paths. 

The  mean  distance  of  paths  from  the  observer. 


*  Vol.  i,  3d  memoir.  t  Series  2,  toI.  xxxix,  p.  193. 
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The  mean  foreshortening  of  paths. 

The  mean  length  of  the  visible  part  of  the  real  paths. 

The  mean  time  of  flight,  as  estimated  by  observers. 

The  distribution  of  the  orbits  of  meteoroids  in  the  solar  system. 

The  daily  number  of  shooting  stars,  and  the  density  of  the 
meteoroides  in  the  space  which  the  earth  traverses. 

The  average  number  of  shooting  stars  which  enter  the  atmosphere 
daily,  and  which  are  large  enough  to  be  visible  to  the  naked  eye, 
if  the  sun,  moon  and  clouds  would  permit  it,  is  more  than  seven 
and  a  half  millions.  Certain  observations  with  instruments  seem 
to  indicate  that  this  number  should  be  increased  to  more  than 
four  hundred  millions,  to  include  telescopic  shooting  stars,  and  there 
is  no  reason  to  doubt  that  an  increase  of  optical  power  beyond  that 
employed  in  these  observations  would  reveal  still  larger  numbers  of 
these  small  bodies.  In  each  volume  of  the  size  of  the  earth,  of  the 
space  which  the  earth  is  traversing  in  its  orbit  about  the  sun,  there 
are  as  many  as  thirteen  thousand  small  bodies,  each  of  which  is 
such  as  would  furnish  a  shooting  star  visible  under  favorable 
circumstances  to  the  naked  eye. 

These  conclusions  are  certainly  of  a  startling  character,  but  not  of 
greater  interest  than  those  relating  to  the  velocity  of  meteoroids. 
There  are  two  velocities  to  be  considered,  which  are  evidently  con- 
nected, the  velocity  relative  to  the  earth,  and  the  velocity  of  the 
meteoroids  in  the  solar  system.  To  the  latter,  great  interest  attaches 
from  the  fact  that  it  determines  the  nature  of  the  orbit  of  the 
meteoroid.  A  velocity  equal  to  that  of  the  earth,  indicates  an  orbit 
like  that  of  the  earth;  a  velocity  ^/2  times  as  great,  a  parabolic 
orbit  like  that  of  most  comets,  while  a  velocity  greater  than  this 
indicates  a  hyperbolic  orbit. 

Professor  Newton  sought  to  form  an  estimate  of  this  critical 
quantity  in  more  than  one  way.  That  on  which  he  placed  most 
reliance  was  based  on  a  comparison  of  the  numbers  of  shooting  stars 
seen  in  the  different  hours  of  the  night.  It  is  evident  that  in  the 
morning,  when  we  are  in  front  of  the  earth  in  its  motion  about  the 
sun,  we  should  see  more  shooting  stars  than  in  the  evening,  when  we 
are  behind  the  earth ;  but  the  greater  the  velocity  of  the  meteoroids 
compared  with  that  of  the  earth,  the  less  the  difference  would  be  in 
the  numbers  of  evening  and  morning  stars.* 


*  It  may  not  be  oat  of  place  to  notiee  here  an  erratum  which  ooours  both  in  the 
Memoirs  of  the  National  Academy  and  in  the  abstraot  in  the  American  Journal  of 
Science,  and  which  the  writer  finds  marked  in  a  private  oopy  of  Professor  Newton's. 
In  the  table  on  p.  20  of  the  memoir  and  206  of  the  abstraot,  the  column  of  numbers 
under  the  head  "hour  of  the  night"  should  be  inverted.  There  is  another  displace- 
ment in  the  table  in  the  memoir,  which  is,  however,  oorreoted  in  the  abstract. 
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After  a  careful  discussion  of  the  evidence  Professor.  Newton  reached 
the  conclusion  that  "we  must  regard  as  almost  certain  (on  the 
hypothesis  of  an  equable  distribution  of  the  directions  of  absolute 
motions),  that  the  mean  velocity  of  the  meteoroids  exceeds  con- 
siderably that  of  the  earth;  that  the  orbits  are  not  approximately 
circular,  but  resemble  more  the  orbits  of  comets.** 

This  last  sentence,  which  is  taken  from  the  abstract  published  in 
the  American  Jov/rnal  of  Science  in  1865,  and  is  a  little  more 
definitely  and  positively  expressed  than  the  corresponding  passage 
in  the  original  memoir,  indicating  apparently  that  the  author's 
conviction  had  been  growing  more  positive  in  the  interval,  or  at 
least  that  the  importance  of  the  conclusion  had  been  growing  upon 
him,  embodies  what  is  perhaps  the  most  important  result  of  the 
memoir,  and  derives  a  curious  significance  from  the  discoveries  which 
were  to  astonish  astronomers  in  the  immediate  future. 

The  return  of  the  November  or  Leonid  shower  in  1865,  and 
especially  in  1866,  when  the  display  was  very  brilliant  in  Europe, 
gave  an  immense  stimulus  to  meteoric  study,  and  an  especial 
prominence  to  this  group  of  meteoroids.  "Not  since  the  year 
1759,"  says  Schiaparelli,  "when  the  predicted  return  of  a  comet  first 
took  place,  had  the  verified  prediction  of  a  periodic  phenomenon 
made  a  greatei  impression  than  the  magnificent  spectacle  of 
November,  1866.  The  study  of  cosmic  meteors  thereby  gained  the 
dignity  of  a  science,  and  took  finally  an  honorable  place  among 
the  other  branches  of  astronomy ."  *  Professor  J.  C.  Adams,  of 
Cambridge,  England,  then  took  up  the  calculation  of  the  perturba- 
tions determining  the  motion  of  the  node.  We  have  seen  that 
Professor  Newton  had  shown  that  the  periodic  time  was  limited  to 
five  sharply  determined  values,  each  of  which  with  the  other  data 
would  give  an  orbit,  and  that  the  true  orbit  could  be  distinguished 
from  the  other  four  by  the  calculation  of  the  secular  motion  of  the 
node. 

Professor  Adams  first  calculated  the  motion  of  the  node  due  to  the 
attractions  of  Jupiter,  Venus,  and  the  Earth  for  the  orbit  having  a 
period  of  354*6  days.  This  amounted  to  a  little  less  than  12'  in 
33*25  years.  As  Professor  Newton  had  shown  that  the  dates  of  the 
showers  require  a  motion  of  29'  in  3325  years,  the  period  of  354 6 
days  must  be  rejected.  The  case  would  be  nearly  the  same  with  a 
period  of  376*6  days,  while  a  period  of  180  or  185-4  days  would  give 
a  still  smaller  motion  of  the  node.  Hence,  of  the  five  possible  periods 
indicated  by  Professor  Newton,  four  were  shown  to  be  entirely 
incompatible  with  the  motion  of  the  node,  and  it  only  remained  to 


*  Schiaparelli,  Bntwwf  eintr  (utrowmUchen  Theorie  der  Siernschnuppen,,  p.  66 
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examine  whether  the  fifth  period,  viz.  that  of  33*25  years,  would  give 
a  motion  of  the  node  in  accordance  with  the  observed  value.  As 
this  period  gives  a  very  long  ellipse  for  the  orbit,  extending  a  little 
beyond  the  orbit  of  Uranus,  it  was  necessary  to  take  account  of  the 
perturbations  due  to  that  planet  and  to  Saturn.  Professor  Adams 
found  28'  for  the  motion  of  the  node.  As  this  value  must  be  regarded 
as  sensibly  identical  with  Professor  Newton's  29'  of  observed  motion, 
no  doubt  was  left  in  regard  to  the  period  of  revolution  or  the  orbit  of 
the  meteoroids.* 

About  this  time,  M.  Schiaparelli  was  led  by  a  course  of  reasoning 
similar  to  Professor  Newton's  to  the  same  conclusion,-— that  the  mean 
velocity  of  the  meteoroids  is  not  very  different  from  that  due  to 
parabolic  orbits.  In  the  course  of  his  speculations  in  regard  to  the 
manner  in  which  such  bodies  might  enter  the  solar  system,  the 
questions  suggested  themselves :  whether  meteoroids  and  comets  may 
not  have  a  similar  origin;  whether,  in  case  a  swarm  of  meteoroids 
should  include  a  body  of  sufficient  size,  this  would  not  appear  as  a 
comet;  and  whether  some  of  the  known  comets  may  not  belong  to 
streams  of  meteoroids.  Calculating  the  orbit  of  the  Perseids,  or 
August  meteoroids,  from  the  radiant  point,  with  the  assumption  of  a 
nearly  parabolic  velocity,  he  found  an  orbit  very  similar  to  that  of 
the  great  comet  of  1862,  which  may  therefore  be  considered  as  one 
of  the  Perseids, — probably  the  largest  of  them  alLt 

At  that  time  no  known  coraetic  orbit  agreed  with  that  of  the 
Leonids,  but  a  few  months  later,  as  soon  as  the  definitive  elements  of 
the  orbit  of  the  first  comet  of  1866  were  published,  their  resemblance 
to  those  of  the  Leonids,  as  calculated  for  the  period  of  33*25  years, 
which  had  been  proved  to  be  the  correct  value,  was  strikingly 
manifested,  attracting  at  once  the  notice  of  several  astronomers. 

Other  relations  of  the  same  kind  have  been  discovered  later,  of 
which  that  of  Biela's  comet  and  the  Andromeds  is  the  most  interesting, 
as  we  have  seen  the  comet  breaking  up  under  the  influence  of  the 
sun;  but  in  no  case  is  the  coincidence  so  striking  as  in  that  of 
the  Leonids,  since  in  no  other  case  is  the  orbit  of  the  meteoroids 
completely  known,  independently  of  that  of  the  comet,  and  without 
any  arbitrary  assumption  in  regard  to  their  periodic  time. 

The  first  comet  of  1866  is  probably  not  the  only  one  belonging  to 
the  Leonid  stream  of  meteoroids.  Professor  Newton  has  remarked 
that  the  Chinese  annals  mention  two  comets  which  passed  rapidly  in 
succession  across  the  sky  in  1366,  a  few  days  after  the  passage  of  the 
earth  through  the  node  of  the  Leonid  stream,  which  was  marked  in 
Europe  by  one  of  the  most  remarkable  star-showers  on  record.    The 


*  Monthly  Notices  Soy.  Ast.  8oct  voL  xxvii,  p.  247.  t  Bntwwf,  etc,  pp.  49-54. 
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course  of  these  comets,  as  described  by  the  annalists,  was  in  the  line 
of  the  Leonid  stream.* 

This  identification  of  comets  with  meteors  or  shooting-stars  marks 
an  epoch  in  the  study  of  the  latter.  Henceforth,  they  must  be 
studied  in  connection  with  comets.  It  was  presumably  this  discovery 
which  led  Professor  Newton  to  those  statistical  investigations 
respecting  comets,  which  we  shall  presently  consider.  At  this  point, 
however,  at  the  close  as  it  were  of  the  first  chapter  in  the  history  of 
meteoric  science,  it  seems  not  unfitting  to  quote  the  words  of  an 
eminent  foreign  astronomer,  written  about  this  time,  in  regard  to 
Professor  Newton's  contributions  to  this  subject.  In  an  elaborate 
memoir  in  the  Comptea  Rendus,  M.  Faye  says,  with  reference  to  our 
knowledge  of  shooting-stars  and  their  orbits,  "we  may  find  in  the 
works  of  M.  Newton,  of  the  United  States,  the  most  advanced 
expression  of  the  state  of  science  on  this  subject,  and  even  the  germ, 
I  think,  of  the  very  remarkable  ideas  brought  forward  in  these  last 
days  by  M.  Schiaparelli  and  M.  Le  Verrier."  t 

The  first  fruit  of  Professor  Newton's  statistical  studies  on  comets 
appeared  in  1878  in  a  paper  "On  the  Origin  of  Comets."  In  this 
paper  he  considers  the  distribution  in  the  solar  system  of  the  known 
cometic  orbits,  and  compares  it  with  what  we  might  expect  on  either 
of  two  hypotheses :  that  of  Kant,  that  the  comets  were  formed  in  the 
evolution  of  the  solar  system  from  the  more  distant  portion  of  the 
solar  nebula;  and  that  of  Laplace,  that  the  comets  have  come  from  the 
stellar  spaces  and  in  their  origin  had  no  relation  to  the  solar  system. 

In  regard  to  the  distribution  of  the  aphelia,  he  shows  that,  except 
so  far  as  modified  by  the  perturbations  due  to  the  planets,  the  theory 
of  internal  origin  would  require  all  the  aphelia  to  be  in  the  vicinity 
of  the  ecliptic.;  the  theory  of  external  origin  would  make  all  direc- 
tions of  the  aphelia  equally  probable,  i.e.,  the  distribution  in  latitude 
of  the  aphelia  should  be  that  in  which  the  frequency  is  as  the  cosine 
of  the  latitude.  The  actual  distribution  comes  very  near  to  this,  but 
as  the  effect  of  perturbations  would  tend  to  equalize  the  distribution 
of  aphelia  in  all  directions,  Professor  Newton  does  not  regard  this 
argument  as  entirely  deciaiva  He  remarks,  however,  that  if  Kant's 
hypothesis  be  true,  the  comets  must  have  been  revolving  in  their 
orbits  a  very  long  time,  and  the  process  of  the  disintegration  of  comets 
must  be  very  slow. 

In  regard  to  the  distribution  of  the  orbits  in  inclination,  the  author 
shows  that  the  theory  of  internal  origin  would  make  all  inclinations 
equally  probable;    the  theory  of  external   origin   would  make  all 

•  Amer.  Jour.  Sci.,  uer.  2,  vol.  xliii,  p.  298,  and  vol.  xlv,  p.  91,  or  Encycl.  Briton*., 
article  Meteor, 
t  Comptt*  Rendu*,  t.  Ixiv,  p.  551. 
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directions  of  the  normal  to  the  plane  of  the  orbit  equally  probable. 
On  the  first  hypothesis,  therefore,  we  should  expect  a  uniform 
distribution  in  inclination;  on  the  second,  a  frequency  proportioned 
to  the  sine  of  the  inclination.  It  was  shown  by  a  diagram  in  which 
the  actual  and  the  two  theoretical  distributions  are  represented 
graphically,  that  the  actual  distribution  agrees  pretty  well  with 
the  theory  of  external  origin  and  not  at  all  with  that  of  internal 
origin.  It  was  also  shown  that  the  curve  of  actual  distribution 
cannot  be  made  to  agree  with  Kant's  hypothesis  by  any  simple 
and  reasonable  allowances  for  perturbations.  On  the  other  hand, 
if  we  assume  the  external  origin  of  comets,  and  ask  how  the  curve 
of  sines  must  be  modified  in  order  to  take  account  of  perturbations, 
it  is  shown  that  the  principal  effect  will  be  to  increase  somewhat 
the  number  of  inclinations  between  90°  and  185°  at  the  expense  of 
those  between  45°  and  90°.  It  is  apparent  at  once  from  the  diagram 
that  such  a  change  would  make  a  very  good  agreement  between  the 
actual  and  theoretical  curves,  the  only  important  difference  remaining 
being  due  to  comets  of  short  periods,  which  mostly  have  small 
inclinations  with  direct  motion.  These  should  not  weigh  very  much, 
the  author  observes,  in  the  general  question  of  the  distribution  of 
inclinations,  because  they  return  so  frequently  and  are  so  easily 
detected  that  their  number  in  a  list  of  observed  comets  is  out  of  all 
proportion  to  their  number  among  existing  comets.  But  this  group 
of  comets  of  short  periods  can  easily  be  explained  on  the  theory  of  an 
external  origin.  For  such  comets  must  have  lost  a  large  part  of 
their  velocity  by  the  influence  of  a  planet  This  is  only4ikely  to 
happen  when  a  comet  overtakes  the  planet  and  passes  in  front  of  it 
This  implies  that  its  original  motion  was  direct  and  in  an  orbit  of 
small  inclination  to  that  of  the  planets,  and  although  it  may  lose  a 
large  part  of  its  velocity,  its  motion  will  generally  remain  direct  and 
in  a  plane  of  small  inclination.  This  very  interesting  case  of  the 
comets  of  short  periods  and  small  inclinations,  which  was  treated 
rather  briefly  in  this  paper,  was  discussed  more  fully  by  Professor 
Newton  at  the  meeting  of  the  British  Association  in  the  following 
year.* 

Many  years  later,  Professor  Newton  returned  to  the  same  general 
subject  in  a  very  interesting  memoir  "  On  the  Capture  of  Comets  by 
Planets;  especially  their  Capture  by  Jupiter,"  which  was  read  before 
the  National  Academy  in  1891,  and  appeared  in  the  Memoirs  of  the 
Academy  two  years  later,  t  It  also  appeared  in  the  American 
Jowrnal  of  Science  in  the  year  in  which  it  was  read.  J     This  contains 

*  Bep't  BriL  Abooc  Adv.  ScL  for  1879,  p.  272. 

fifem.  Nat.  Acad.,  voL  vi,  1st  memoir. 

XAmer.  Jour.  ScLt  ser.  8,  voL  xlii,  pp.  183  and  482. 
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the  results  of  careful  statistical  calculations  on  the  effect  of  per- 
turbations on  orbits  of  comets  originally  parabolic.  It  corroborates 
the  more  general  statements  of  the  paper  "  On  the  Origin  of  Comets," 
giving  them  a  precise  quantitative  form.  One  or  two  quotations 
will  give  some  idea  of  the  nature  of  this  very  elaborate  and  curious 
memoir,  in  which,  however,  the  results  are  largely  presented  in  the 
form  of  diagrams. 

On  a  certain  hypothesis  regarding  an  original  equable  distribution 
of  comets  in  parabolic  orbits  about  the  sun,  it  is  shown  that  "if  in 
a  given  period  of  time  a  thousand  million  comets  come  in  parabolic 
orbits  nearer  to  the  sun  than  Jupiter,  126  of  them  will  have  their 
orbits  changed"  by  the  action  of  that  planet  "into  ellipses  with 
periodic  times  less  than  one-half  that  of  Jupiter;  889  of  them  will 
have  their  orbits  changed  into  ellipses  with  periodic  times  less  than 
that  of  Jupiter;  1701  of  them  will  have  their  orbits  changed  into 
ellipses  with  periodic  times  less  than  once  and  a  half  that  of  Jupiter, 
and  2670  of  them  will  have  their  orbits  changed  into  ellipses  with 
periodic  times  less  than  twice  that  of  Jupiter."  A  little  later,  Pro- 
fessor Newton  considers  the  question,  which  he  characterises  as 
perhaps  more  important,  of  the  direct  or  retrograde  motion  of  the 
comets  after  such  perturbations.  It  is  shown  that  of  the  839  comets 
which  have  periodic  times  less  than  Jupiter,  203  will  have  retrograde 
motions,  and  636  will  have  direct  motions.  Of  the  203  with  retro- 
grade motion,  and  of  the  636  with  direct  motion,  51  and  257,  respec- 
tively, will  have  orbits  inclined  less  than  30°  to  that  of  Jupiter. 

We  have  seen  that  the  earliest  of  Professor  Newton's  more  important 
studies  on  meteors  related  to  the  Leonids,  which  at  that  time  far 
surpassed  all  other  meteoric  streams  in  interest  One  of  his  later 
studies  related  to  another  stream  which  in  the  mean  time  had  acquired 
great  importance.  The  identification  of  the  orbit  of  the  Andromed 
meteors  with  that  of  Biela's  comet,  which  we  have  already  mentioned, 
gave  these  bodies  a  unique  interest,  as  the  comet  had  been  seen  to 
break  up  under  the  influence  of  the  sun.  Here  the  evolution  of 
meteoroids  was  taking  place  before  our  eyes;  and  this  interest  was 
heightened  by  the  showers  of  1872  and  1885,  which  in  Europe  seem 
to  have  been  unsurpassed  in  brilliancy  by  any  which  have  occurred 
in  this  century. 

The  phenomena  of  each  of  these  showers  were  carefully  discussed 
by  Professor  Newton.  Among  the  principal  results  of  his  paper  on 
the  latter  shower  are  the  following  :* 

The  time  of  the  maximum  frequency  of  meteors  was  Nov.  27, 1885 
6  h.  15  m.  Gr.  m.  t    The  estimated  number  per  hour  visible  at  one 


*  Amer.  Jowr.  8cLt  ser.  3,  voL  xxxi,  p.  409. 
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place  was  then  75,000.  This  gives  a  density  of  the  meteoroids  in 
space  represented  by  one  to  a  cube  of  twenty  miles  edge.  Three 
hours  later  the  frequency  had  fallen  to  one-tenth  of  the  maximum 
value.  The  really  dense  portion  of  the  stream  through  which  we 
passed  was  less  than  100,000  miles  in  thickness,  and  nearly  all  would 
be  included  in  a  thickness  of  200,000  miles. 

A  formula  is  given  to  express  the  effect  of  the  earth's  attraction 
on  the  approaching  meteoroids  in  altering  the  position  of  the  radiant. 
This  is  technically  known  as  the  zenithal  attraction,  and  is  quite 
important  in  the  case  of  these  meteors  on  account  of  their  small 
relative  velocity.  The  significance  of  the  formula  may  be  roughly 
expressed  by  saying  that  the  earth's  attraction  changes  the  radiant  of 
the  Biela  meteors,  toward  the  vertical  of  the  observer,  one-tenth  of 
the  observed  zenith  distance  of  the  radiant,.or  more  briefly,  that  the 
zenithal  attraction  for  these  meteors  is  one-tenth  of  the  observed 
zenith  distance.  The  radiant  even  after  the  correction  for  zenithal 
attraction,  and  another  for  the  rotation  of  the  earth  on  its  axis,  is  not 
a  point  but  an  area  of  several  degrees  diameter.  The  same  has  been 
observed  in  regard  to  other  showers,  but  the  result  comes  out  more 
distinctly  in  the  present  case  because  the  meteors  were  so  numerous 
and  the  shower  so  carefully  observed. 

This  implies  a  want  of  parallelism  in  the  paths  of  the  meteors,  and 
it  is  a  very  important  question  whether  it  exists  before  the  meteoroids 
enter  our  atmosphere,  or  whether  it  is  due  to  the  action  of  the 
atmosphere. 

Professor  Newton  shows  that  it  is  difficult  to  account  for  so  large 
a  difference  in  the  original  motions  of  the  meteoroids,  and  thinks  it 
reasonable  to  attribute  a  large  part  of  the  want  of  parallelism  to  the 
action  of  the  atmosphere  on  bodies  of  an  irregular  form,  such  as  we 
have  every  reason  to  believe  that  the  meteoroids  have,  when  they 
enter  our  atmosphere.  The  effect  of  the  heat  generated  will  be  to 
round  off  the  edges  and  prominent  parts,  and  to  reduce  the  meteor 
to  a  form  more  and  more  spherical.  It  is,  therefore,  quite  natural 
that  the  greater  portion  of  the  curvature  of  the  paths  should  be 
in  the  invisible  portion  and  thus  escape  our  notice.  It  is  only  in 
exceptional  cases  that  the  visible  path  is  notably  curved. 

But  the  great  interest  of  the  paper  centers  in  his  discussion  of  the 
relation  of  this  shower  to  preceding  showers,  and  to  the  orbit  of 
Biela's  comet.  The  changes  in  the  date  of  the  shower  (from  Dec  6 
to  Nov.  27)  and  in  the  position  of  the  radiant  are  shown  to  be  related 
to  the  great  perturbations  of  Biela's  comet  in  1794, 1831,  and  1841-2. 
The  showers  observed  by  Brandes,  Dec  6th,  1798,  by  Herrick,  Dec  7th, 
1888,  and  by  Heis,  Dec  8th  and  10th,  1847,  are  related  to  the  orbit 
of  Biela's  comet  as  it  was  in  1772 ;  while  the  great  showers  of  1872 
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and  1885,  as  well  as  a  trifling  display  in  1867,  are  related  to  the 
orbit  of  1852.* 

Assuming,  then,  that  the  meteoroids  which  we  met  on  the  27th  of 
November,  1872,  did  not  leave  the  immediate  neighborhood  of  the 
Biela  comet  before  1841-2,  we  seem  to  have  the  data  for  a  very 
precise  determination  of  their  orbit  between  those  dates.  The  same 
is  true  of  those  which  we  met  in  1885.  The  computation  of  these 
orbits,  the  author  remarks,  may  possibly  give  evidence  for  or  against 
the  existence  of  a  resisting  medium  in  the  solar  system. 

In  his  last  public  utterance  on  the  subject  of  meteors,  which  was 
on  the  occasion  of  the  recent  sesquicentennial  celebration  of  the 
American  Philosophical  Society,  Professor  Newton  returns  to  the  Biela 
meteoroids,  and  finds  in  the  scattering  which  they  show  in  the  plane 
of  their  orbit  the  proof  of  a  disturbing  force  in  that  plane,  and  there- 
fore not  due  to  the  planets.  The  force  exerted  by  the  sun  appears  to 
be  modified  somewhat  as  we  see  it  in  the  comet's  tails,  where  indeed 
the  attraction  is  changed  into  a  repulsion.  Something  of  the  same 
sort  on  a  smaller  scale  relatively  to  the  mass  of  the  bodies  appears  to 
modify  the  sun's  action  on  the  meteoroids. 

In  1888  Professor  Newton  read  a  paper  before  the  National 
Academy  "Upon  the  relation  which  the  former  Orbits  of  those 
Meteorites  that  are  in  our  collections,  and  that  were  seen  to  fall, 
had  to  the  Earth's  Orbit."  This  was  based  upon  a  very  careful  study 
of  more  than  116  cases  for  which  we  have  statements  indicating  more 
or  less  definitely  the  direction  of  the  path  through  the  air,  as  well  as 
94  cases  in  which  we  only  know  the  time  of  the  fall.  The  results  are 
expressed  in  the  following  three  propositions : 

1.  The  meteorites  which  we  have  in  our  cabinets  and  which  were 
seen  to  fall  were  originally  (as  a  class,  and  with  a  very  small 
number  of  exceptions)  moving  about  the  sun  in  orbits  that  had 
inclinations  less  than  90°;  that  is,  their  motions  were  direct,  not 
retrograde. 

2.  The  reason  why  we  have  only  this  class  of  stones  in  our  collec- 
tions is  not  one  wholly  or  even  mainly  dependent  on  the  habits  of 
men ;  nor  on  the  times  when  men  are  out  of  doors ;  nor  on  the  places 
where  men  live ;  nor  on  any  other  principle  of  selection  acting  at  or 
after  the  arrival  of  the  stones  at  the  ground.  Either  the  stones 
which  are  moving  in  the  solar  system  across  the  earth's  orbit  move 
in  general  in  direct  orbits ;  or  else  for  some  reason  the  stones  which 


*  It  is  a  curious  coincidence  that  the  original  discoverer  of  the  Deoember  shower  as 
a  periodic  phenomenon,  Mr.  Edward  C.  Herrick,  should  have  been  (with  a  companion, 
Mr.  Francis  Bradley,)  the  first  to  observe  that  breaking  up  of  the  parent  body  whioh 
was  destined  to  reinforce  the  meteorio  stream  in  so  remarkable  a  manner.  See  Amer, 
Jour.  Set.,  ser.  3,  vol.  xxzi,  pp.  86  and  88. 
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move  in  retrograde  orbits  do  not  in  general  come  through  the  air  to 
the  ground  in  solid  form. 

3.  The  perihelion  distances  of  nearly  all  the  orbits  in  which  these 
stones  moved  were  not  less  than  0*5  nor  more  than  1*0,  the  earth's 
radius  vector  being  unity. 

Professor  Newton  adds,  that  it  seems  a  natural  and  proper  corollary 
to  these  propositions  (unless  it  shall  appear  that  stones  meeting  the 
earth  are  destroyed  in  the  air)  that  the  larger  meteorites  moving  in 
our  solar  system  are  allied  much  more  closely  with  the  group  of 
comets  of  short  period  than  with  comets  whose  orbits  are  nearly 
parabolic.  All  the  known  comets  of  shorter  periods  than  83  years 
move  about  the  sun  in  direct  orbits  that  have  moderate  inclinations 
to  the  ecliptic.  On  the  contrary,  of  the  nearly  parabolic  orbits  that 
are  known  only  a  small  proportion  of  the  whole  number  have  small 
inclinations  with  direct  motion. 

We  have  briefly  mentioned  those  papers  which  seem  to  constitute 
the  most  important  contributions  to  the  science  of  meteors  and  comets. 
To  fully  appreciate  Professor  Newton's  activity  in  this  field,  it  would 
be  necessary  to  take  account  of  his  minor  contributions.* 

Most  interesting  and  instructive  to  the  general  reader  are  his 
utterances  on  occasions  when  he  has  given  a  resume*  of  our  knowledge 
on  these  subjects  or  some  branch  of  them,  as  in  the  address  "  On  the 
Meteorites,  the  Meteors,  and  the  Shooting  Stars,"  which  he  delivered 
in  1886  as  retiring  president  of  the  American  Association  for  the 
Advancement  of  Science,  or  in  certain  lectures  in  the  public  courses 
of  the  Sheffield  Scientific  School  of  Yale  University,  entitled  "The 
story  of  Biela's  Comet"  (1874),  "The  relation  of  Meteorites  to 
Comets"  (1876),  "The  Worship  of  Meteorites"  (1889),  or  in  the 
articles  on  Meteors  in  the  Encyclopoedia  Britannioa  and  Johnson's 
Cyclopaedia. 

If  we  ask  what  traits  of  mind  and  character  are  indicated  by 
these  papers,  the  answer  is  not  difficult.  Professor  Klein  has  divided 
mathematical  minds  into  three  leading  classes:  the  logicians,  whose 
pleasure  and  power  lies  in  subtility  of  definition  and  dialectic  skill ; 
the  geometers,  whose  power  lies  in  the  use  of  the  space-intuitions; 
and  the  formalists,  who  seek  to  find  an  algorithm  for  every  operation.t 
Professor  Newton  evidently  belonged  to  the  second  of  these  classes, 
and  his  natural  tastes  seem  to  have  found  an  equal  gratification 
in  the  development  of  a  system  of  abstract  geometric  truths,  or 


*  These  were  detailed  in  a  bibliography  annexed  to  this  paper  in  Amer.  Jomr.  8d.t 
Mr.  4,  toI.  iii. 
t  Lactones  on  Mathematics  (Bvaneton),  p.  2. 
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in  the  investigation  of  the  concrete  phenomena  of  nature  as  they 
exist  in  space  and  time. 

Bat  these  papers  show  more  than  the  type  of  mind  of  the  anther ; 
they  give  no  uncertain  testimony  concerning  the  character  of  the 
man.  In  all  these  papers  we  see  a  love  of  honest  work,  an  aversion 
to  shams,  a  distrust  of  rash  generalizations  and  speculations  baaed 
on  unoertain  premises.  He  was  never  anxious  to  add  one  more 
guess  on  doubtful  matters  in  the  hope  of  hitting  the  truth,  or  what 
might  pass  as  such  for  a  time,  but  was  always  willing  to  take 
infinite  pains  in  the  most  careful  test  of  every  theory.  To  these 
qualities  was  joined  a  modesty  which  forbade  the  pushing  of  his 
own  claims,  and  desired  no  reputation  except  the  unsought  tribute 
ol  competent  judges.  At  the  close  of  his  article  on  meteors  in  the 
Encyclopaedia  BrUounnica,  which  has  not  the  least  reference  to  himself 
as  a  contributor  to  the  science,  he  remarks  that  "meteoric  science 
is  a  structure  built  stone  by  stone  by  many  builders."  We  may 
add  that  no  one  has  done  more  than  himself  to  establish  the 
foundations  of  the  science,  and  that  the  stones  which  he  has  laid 
are  not  likely  to  need  relaying. 

The  value  of  Professor  Newton's  work  has  been  recognised  by 
learned  societies  and  institutions  both  at  home  and  abroad.  He 
received  the  honorary  degree  of  Doctor  of  Laws  from  the  University 
of  Michigan  in  1868.  He  was  president  of  the  section  of  Mathematics 
and  Astronomy  in  the  American  Association  for  the  Advancement 
in  Science  in  1875,  and  president  of  the  Association  in  1885.  On 
the  first  occasion  he  delivered  an  address  entitled  "A  plea  for  the 
study  of  pure  Mathematics";  on  the  second  the  address  on  Meteorites, 
eta,  which  we  have  already  mentioned.  Of  the  American  Mathe- 
matical Society  he  was  vice-president  at  the  time  of  his  death. 
In  1888  the  J.  Lawrence  Smith  gold  medal  was  awarded  to  him 
by  the  National  Academy  for  his  investigations  on  the  orbits  of 
meteoroids.  We  may  quote  a  sentence  or  two  from  his  reply  to 
the  address  of  presentation,  so  characteristic  are  they  of  the  man 
that  uttered  them:  "To  discover  some  new  truth  in  nature,"  he 
said,  "  even  though  it  concerns  the  small  things  in  the  world,  gives 
one  of  the  purest  pleasures  in  human  experience.  It  gives  joy  to 
tell  others  of  the  treasure  found." 

Besides  the  various  learned  societies  in  our  own  country  of  which 
he  was  a  member,  including  the  American  Academy  of  Arts  and 
Sciences  from  1862,  the  National  Academy  of  Sciences  from  its 
foundation  in  1863,  the  American  Philosophical  Society  from  1867, 
he  was  elected  in  1872  Associate  of  the  Royal  Astronomical  Society 
of  London,  in  1886  Foreign  Fellow  of  the  Royal  Society  of  Edinburgh, 
and  in  1892  Foreign  Member  of  the  Royal  Society  of  London. 
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But  the  studies  which  have  won  for  their  author  an  honorable 
reputation  among  men  of  science  of  all  countries,  form  only  one 
side  of  the  life  of  the  man  whom  we  are  considering.  Another 
side,  probably  the  most  important,  is  that  in  which  he  was  identified 
with  the  organic  life  of  the  College  and  University  with  which 
he  had  been  connected  from  a  very  early  age.  In  fact,  we  might 
almost  call  the  studies  which  we  have  been  considering,  the  recreations 
of  a  busy  life  of  one  whose  serious  occupation  has  been  that  of  an 
instructor.  If  from  all  those  who  have  come  under  his  instruction 
we  should  seek  to  learn  their  personal  recollections  of  Professor 
Newton,  we  should  probably  find  that  the  most  universal  impression 
made  on  his  students  was  his  enthusiastic  love  of  the  subject  which 
he  was  teaching. 

A  department  of  the  University  in  which  he  took  an  especial 
interest  was  the  Observatory.  This  was  placed  under  his  direction 
at  its  organisation,  and  although  he  subsequently  resigned  the 
nominal  directorship,  the  institution  remained  virtually  under  his 
charge,  and  may  be  said  to  owe  its  existence  in  large  measure 
to  his  untiring  efforts  and  personal  sacrifice  in  its  behalf. 

One  sphere  of  activity  in  the  Observatory  was  suggested  by  a 
happy  accident  which  Professor  Newton  has  described  in  the  American 
Journal  of  Science,  September,  1893.  An  amateur  astronomer  in 
a  neighboring  town,  Mr.  John  Lewis,  accidentally  obtained  on  a 
stellar  photograph  the  track  of  a  large  meteor.  He  announced  in 
the  newspapers  that  he  had  secured  such  a  photograph,  and  requested 
observations  from  those  who  had  seen  its  flight  The  photographic 
plate  and  the  letters  received  from  various  observers  were  placed  in 
Professor  Newton's  hands,  and  were  discussed  in  the  paper  mentioned. 
The  advantages  of  photographic  observations  were  so  conspicuous 
that  Professor  Newton  was  anxious  that  the  Observatory  should 
employ  this  method  of  securing  the  tracks  of  meteors.  With  the 
aid  of  an  appropriation  granted  by  the  National  Academy  from  the 
income  of  the  J.  Lawrence  Smith  fund,  a  battery  of  cameras  was 
mounted  on  an  equatorial  axis.  By  this  means,  a  number  of  meteor- 
tracks  have  been  obtained  of  the  August  meteors,  and  in  one  case, 
through  a  simultaneous  observation  by  Mr.  Lewis  in  Ansonia,  Professor 
Newton  was  able  to  calculate  the  course  of  the  meteor  in  the  atmo- 
sphere with  a  probable  error  which  he  estimated  at  less  than  a  mile. 
The  results  which  may  be  expected  at  the  now  near  return  of  the 
Leonids,  will  be  of  especial  interest,  but  it  will  be  for  others  to  utilize 
them. 

Professor  Newton  was  much  interested  in  the  collection  of  meteorites, 
and  the  fine  collection  of  stones  and  irons  in  the  Peabody  Museum 
of  Yale.  University  owes  much  to  his  efforts  in  this  direction. 
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Professor  Newton  was  a  member  of  the  American  Metrological 
Society  from  the  first,  and  was  conspicuously  active  in  the  agitation 
which  resulted  in  the  enactment  of  the  law  of  1866,  legalizing  the 
use  of  the  metric  system.  He  prepared  the  table  of  the  metric 
equivalents  of  the  customary  units  of  weights  and  measures  which 
was  incorporated  in  the  act,  and  by  which  the  relations  of 
the  fundamental  units  were  defined.  But  he  did  not  stop  here. 
Appreciating  the  weakness  of  legislative  enactment  compared  with 
popular  sentiment,  and  feeling  that  the  real  battle  was  to  be  won 
in  familiarizing  the  people  with  the  metric  system,  he  took  pains  to 
interest  the  makers  of  scales  and  rulers  and  other  devices  for  measure- 
ment in  adopting  the  units  and  graduations  of  the  metric  system, 
and  to  have  the  proper  tables  introduced  into  school  arithmetics. 

He  was  also  an  active  member  of  the  Connecticut  Academy  of 
Arts  and  Sciences,  serving  several  years  both  as  secretary  and 
president, — also  as  member  of  the  council  He  was  associate  editor 
of  the  American  Journal  of  Science  from  1864,  having  especial 
charge  of  the  department  of  astronomy.  His  notes  on  observations 
of  meteors  and  on  the  progress  of  meteoric  science,  often  very  brief, 
sometimes  more  extended,  but  always  well  considered,  were  especially 
valuable. 

In  spite  of  his  studious  tastes  and  love  of  a  quiet  life,  he  did  not 
shirk  the  duties  of  citizenship,  serving  a  term  as  alderman  in  the 
city  council,  being  elected,  we  may  observe,  in  a  ward  of  polities 
strongly  opposed  to  his  own. 

Professor  Newton  married,  April  14th,  1859,  Anna  C,  daughter 
of  the  Rev.  Joseph  C.  Stiles,  D.D.,  of  Georgia,  at  one  time  pastor 
of  the  Mercer  Street  Presbyterian  Church  in  New  York  City,  and 
subsequently  of  the  South  Church  in  New  Haven.  She  survived 
her  husband  but  three  months,  leaving  two  daughters. 

In  all  these  relations  of  life,  the  subject  of  this  sketch  exhibited 
the  same  traits  of  character  which  are  seen  in  his  published  papers, 
the  same  modesty,  the  same  conscientiousness,  the  same  devotion 
to  high  ideals.  His  life  was  the  quiet  life  of  the  scholar,  ennobled 
by  the  unselfish  aims  of  the  Christian  gentleman;  his  memory  will 
be  cherished  by  many  friends;  and  so  long  as  astronomers,  while 
they  watch  the  return  of  the  Leonids  marking  off  the  passage  of 
the  centuries,  shall  care  to  turn  the  earlier  pages  of  this  branch 
of  astronomy,  his  name  will  have  an  honorable  place  in  the  history  of 
the  science. 
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